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PREFACE 



Je ne doute pas que, pour peu qu’on regarde la progression de ces nombres, 
on ne desespere presque d’y decouvrir le moindre ordre, vu que l’irregu- 
larite de la suite des nombres premiers s’y trouve entremelee tellement, 
qu’il semblera d’abord impossible d’indiquer quelque loi ces nombres 
observent entre eux, sans qu’on sache celle des nombres premiers. 

L. Euler, Opera Omnia, ser. 1, vol. 2, pp. 244-245. 



The course of lectures on which this monograph is based has centered about 
two aspects of the distribution of primes ; aspects which are closely interrelated 
and can roughly be indicated by the headings Statistics and Almost-periodicity. 

Both of these aspects must be dismissed as mere “physics”, if it is assumed 
that the sole purpose of the analytic theory of numbers is to abschatzen. But 
Abschatzungen in themselves can never lead to any insight into the system of 
primes. For instance, about the only thing that can be prophesied from the 
upper estimates obtained thus far in Riemann’s hypothesis or in Dirichlet’s 
divisor problem is that no solution will ever be supplied by this technique. 
Actually, there is a variety of more modest problems of substantially arithmeti- 
cal interest and still of “physical” nature. Some of them turn out to be equiva- 
lent to, some others lie slightly deeper than, while still others, though fascinating 
enough, do not depend on, the prime number theorem. As illustrated by B run’s 
form of the sieve of Eratosthenes, there are no absolute standards in terms of 
which it is meaningful to state that results of the latter kind are by necessity 
of less depth than those depending on the prime number theorem. 

The first of the two “physical” aspects referred to at the beginning will appear 
on all the various levels of the relative depth just mentioned. In fact, that 
chaotic homogeneity which makes any statistics possible at all is, in its various 
forms of refinement, a principal characteristic of the asymptotic distribution of 
the primes. The ergodic behavior in question cannot easily be specified in a 
sufficiently inclusive form; mainly because the underlying probabilities, being 
asymptotic probabilities, are defined in terms of relative measures, in contra- 
distinction to the additive measures of the strictly Lebesgueian theoiy of 
absolute probabilities. 

The other aspect deals with the harmonic analysis of deviations from the 
underlying average state. In this regard, there will be developed a Fourier 
theory for “arbitrary” sequences of numbers; a theory which at first glance 
does not appear to involve the distribution of the primes. Actually, the latter 
will always lurk in the background of the resulting criteria for almost-periodicity. 

iii 
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PREFACE 



To quote Plotinus in a context which he could not fully have appreciated, 
at 5e appovicu at ev rats (poovals cupavels ras (pavepas iroLrjaaaaL (Enneads, I, vi, 3). 
It is only another manifestation of this situation that Riemann’s hypothesis 
is equivalent to the assumption that there exists an harmonic analysis for the 
remainder term of the prime number theorem; cf. the papers referred to under 
[91] in the Bibliography. 

I wish to thank Professor E. K. Haviland for his devoted help in seeing the 
manuscript through the press, and the Waverly Press for their understanding 
cooperation. 



Oakland, Aid., August 1942 



Aurel Wintner 
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ERATOSTHENIAN SUMMATIONS 



Mean values §1- §4 

The Eratosthenian matrix §5- §7 

The pair of Eratosthenian summation methods §8— §12 

Euler products §13— §17 

Tauberian theorems for the Eratosthenian summation methods §18— §24 



Mean Values 
1. Unless stated otherwise, let 

(1) e-z. 

1 

Let n , m, k denote variable positive integers. Correspondingly, by a function 
g = g {n) of n will be meant a sequence of numbers gr(l), gr(2), • • • . 

For every function gin), let a “norm” be defined by 

(2) N(g) = lim sup - E I g(m) \ , 

n~* CO 'H/ 771 = 1 

where N(g) = x is allowed. On the other hand, let the “mean” 

(3) M(g) = lim - E g(m) 

n — * sc n m= 1 

be considered as undefined unless it exists as a finite limit. 

As pointed out by Kronecker [56], 

(4) il 1(g) = 0 if S g(n)/n converges 

(but not conversely). In fact, if a n = g(n)/n , the assertion (4) is that the 
convergence of the series 2a n implies that a\ + 2 a 2 + ••• + na n = o{n). 
But the truth of this assertion follows by observing that the ratio of 
a\ + 2 a 2 + • • • + na n to n is identical with the deviation of the n-th partial 
sum of S a n from the n-th (C, l)-sum of S a„ . 

It is also seen that 

(5) g{n) = o(n) if M (g) exists 
(but not conversely). In fact, (3) is equivalent to 

n 

(6) E g(m) = nM(g) + o(n). 

771= 1 

But if n is replaced by n — 1 in (6), then (5) follows by subtraction. 

It is clear from (6) and (5) that, if ilf(^) exists, both 

XI g( m ) = OxM(g) + o(x) and XI g( m ) = OxM(g) + o(x) as x — > oo 

771 <dX TTl^dx 
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hold not only for d = 1 but for every fixed 0 (where 0 ^ 6 ^ 1). This means 
that the mean of g(n) exists if and only if there exists a number, M(g ), satisfying 

(7) - E F (- S ) d( m ) M(s) [ F(t) dt as n —> x 

Tl m= 1 \7h / J 0 

for the function F(t ), 0 ^ t ^ 1, that is 0 or 1 according as t is not or is between 
0 and 6 , where t = 6 is arbitrarily fixed and both choices F{6) = 0, F(6) = 1 
are allowed. It follows that if M ( g ) exists, then (7) holds for every step function 
F (having a finite number of jumps). In fact, it is clear that if (7) holds for 
F = Fi and for F = F 2 , then it holds for F = C\F\ + c 2 F 2 also, where , c 2 are 
arbitrary constants. 



2. In order to include in (7) functions F that are not step functions, it is 
necessary to subject the function g(n) to suitable Tauberian restrictions of the 
type considered by Axer [1]. The simplest restriction of this type is the assump- 
tion that g(n) has a finite norm (2). In fact, it is clear that if A r (g) < *>, and 
if (7) holds for F = Fi , F 2 , • • • , where {F k (t ) } is a uniformly convergent 
sequence of bounded functions, then (7) holds for F = lim F k also. Hence, 
if N(g) < oo , and if M (g) exists, then (7) holds for every F that is uniform limit 
of step functions; for instance, for every F that either is continuous or has a 
finite number of discontinuities of the first kind. 

The latter condition is not satisfied by* 



(8) F{t) = r 1 - [r 1 ], (0 < 1; Fi 0) = 0, say), 

an E-integrable function (of unbounded variation) which underlies the divisor 
problem of Dirichlet ([18]; cf. [19]). However, the condition is obviously satis- 
fied by each of the functions F 1 , F 2 , • • • , if F k denotes the function attaining the 
value 0 or F according as 0 ^ t < kT 1 or AT 1 ^ t ^ 1, where F is the function 
(8). Furthermore, 0 ^ F k {t) ^ F{t) ^ 1 for every t and so, if e > 0, 



- z 

Tl m / n < e 



*■*(") »<” )S S 



Z 

m<»e 



g{m) | g e l.u.b. \ E I g(m) \ 

1 ^ x < oo X m < x 



But the least upper bound multiplying e is independent of k , ?i, e, and is finite, 
since N(g) < °o by assumption. It follows therefore by letting k — -» oc and 
e — > 0 (in this order), that (7), being true for every F k , must hold for the limit 
function (8) also. 

It follows that, if M(g) exists and N(g) < =c, then 
(9) - E {- - g(m) —> T'(2)M(g) as n ^ . 

Tl m= 1 \7Yl [_^J J 

In fact, substitution of (8) into (7) gives (9), since the integral on the right of 
(7) becomes 1 — C, where C is the Euler-Mascheroni constant; so that 1 — C = 

l + r'(i) = r'(2). 



* The bracket refers to the greatest integer. 
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The above restriction, Nig) < go , is more inclusive than the standard Taube- 
rian condition, g(n) = 0 L ( 1), since the case of unilateral boundedness can be 
reduced to the case Nig) < °c (but not conversely). In fact, since (9) is re- 
duced to the definition of r'(2) when g(n) = 1 for every ft, it is clear that if 

(9) holds for a function g of ft , then it holds for the function g + const, also. 
Hence, gin) = 0 L {1) can be reduced to g(n) ^ 0. But then the existence of 
M(g) implies that Nig) < oo ; cf. (2) and (3). 

3. If Fit), 0 ^ t ^ 1, is an L-integrable function, it may be said to possess 
a density function, say Git), 0 ^ t ^ 1, if 

t-\-h 

(10) J {F(u) — Git)} du = o(\h\) as h -> 0 

holds for every t (according to Lebesgue, not only (10) but also the corresponding 
relation for the absolute value of the integrand is always true for almost all t). 
The notion which parallels the density function in case F{t) is replaced by a 
function /(ft) of the positive integer n goes back to §301-§304 of the Disquisi- 
tiones Arithmeticae [26]. In fact, the considerations of Gauss lead to the idea 
of a density function, say gin), to be defined (if it exists) as follows: 

n+ra 

(11) 2 [fi]) — g(n)\ = o{m) as m — > oo 

l=n + 1 

holds whenever m/n — » 0, where n = n(m) is any fixed function of m. Needless 
to say, g(n) is not uniquely determined by f(n), since g(n) can be replaced by 
gin) + h(n) whenever h{ri) 0 as n — > qo . 

In order to make possible a direct comparison of this notion with the idea 
of a mean M (/) of f{n), only the case will be considered in which g(n) is (or, 
more precisely, can be chosen as) independent of n. Let then the constant 
gin) = gr(l) be denoted by G(J) and called the Gaussian density of f(n). Thus, if 

(12) s(n, m) = fin + 1) + • • • + fin + m), 

it is seen from (11) that the existence of a Gaussian density for /(ft) means the 
existence of a number Gif) having the property that 

(13) sin, m)/m — > (?(/) as m — » co 

holds for any function n = n(m) satisfying m = o(ft) as m — » oo . 

The mutual relationship of M if) and Gif) is often misunderstood in the litera- 
ture. Bachmann’s comments in his Analytische Zahlentheorie (pp. 399-400) 
are misleading, to say the least. The corresponding statements in Kronecker’s 
Vorlesungen (pp. 314-372) are provably false, as will be seen at the end of §4 
below. It must, however, be said that the careful statements in Kronecker’s 
original paper [56] were quite correct, and so the mistake might have been 
introduced by Hensel, the editor of the lectures. 
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4. The actual situation is as follows: If/(n) possesses a Gaussian density, then 
(i) fin) = Oil) and (ii) Gif) = M(f). 

In (ii), the existence of M (/) is part of the statement, the only assumption 
being the existence of Gif). On the other hand, obvious examples show that 
(i) and the existence of M if) together do not imply the existence of Gif). In 
this connection, cf. Hartman and Wintner [43]. 

In view of the definition of Gif), as given by and after (13), the assumption 
is that, if mi < m 2 < • • • and n± < n 2 < • • ■ , then 

(14) sin k , m k )/m k —>(?(/) as k — > cc whenever m k /n k — > 0. 

Let the sequence m[ , m 2 , • • • be defined, in terms of a given sequence 
mi , m 2 , • • • , by the recursion formula m k = m k - 1 + m k , where mo = 0. Sup- 
pose that the sequence mi , m 2 , • • • is so chosen that m k /m k — > 0 as k — > oo . 
Then (14) implies that s(m&_i , m k )/m k — > (?(/). It follows therefore from 
Cauchy’s lemma on averages, that the ratio of s(mo , mi) + • • • + s(m k - 1 , m&) 
to mi + • • • + also tends to Gif). Since the denominator and the numerator 
of this ratio are respectively identical with m k and with the sum (12) belonging 
to n = 0, m = m k , it follows that s(0, m k )/m k — > Gif) as k — » oc . On the other 
hand, it is seen from (12) and (3) that the existence of M(f) means 
that s(0, m)/m — > M (/) holds as m runs through all positive integers. Since 
there are m k positive integers m between m = m k - 1 + 1 and m = m k , and since 
m k /m k — > 0 as A; — > °c , it now follows easily that, in order to prove (ii), it is 
sufficient to prove (i). But it is readily seen from (12) that if (i) is false, then 
it is possible to construct two sequences U\ < n 2 < • • • , mi < m 2 < • • • which 
satisfy m k /n k — > 0 as k — » cc but are such that the limit of s{n k , m k )/m k as k — > oo 
does not exist. However, this contradicts (14). 

As an illustration, let/(n) be 1 or 0 according as n is or is not a prime. Then 
M if) exists and vanishes, since the number of primes not exceeding m is s( 0, m) = 
o(m). On the other hand, not even the prime number theorem, s(l, m) ~ 
m/ log m (or, for that matter, its remainder term supplied by Riemann’s hy- 
pothesis) assures the existence of Gif). That Gif) happens to exist, follows 
from Brun’s form of the sieve of Eratosthenes ([5]; cf. Hardy and Littlewood 
[39], p. 69). In fact, Brun’s elementary approach assures the existence of a 
constant C satisfying s(n, m) < Cm/log (m + 1). But all that is needed for the 
existence (and the vanishing) of Gif) is that s(n, m) = oim) should hold uni- 
formly in n. 

Suppose, however, that the preceding fin) is replaced by the function which 
is log n or 0 according as n is or is not a prime. Then the prime number theorem 
states that M if) exists and has the value 1. It follows therefore from (ii) that 
Gf) = 1, if Gif) exists. But the number of integers n satisfying fin) ^ 0 
and n ^ m is the preceding s(0, m) = o(m), which implies that Gif) = 0, if 
Gf) exists. Actually, the necessary condition (i) is violated. 
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The Eratosthenian Matrix 

5. If either of two arbitrary functions, say / m f(n) and/' = fin), of the 
positive integer n is given, the other function is uniquely determined by the 
assignment 

(15) /(n) = X /'(<*), (» = 1,2, •••) 

d, | n 

(the summation index, d , runs through all divisors of n). In fact, if (15) is 
thought of as a linear transformation 

00 

( 16 ) f(n) = X e nm /'(ra), 

m = 1 

then those elements of the infinite square-matrix ( e nm ) which are situated above 
its principal diagonal all vanish, since d \ n is impossible when d > n, and so 
there exists a unique inverse matrix (e nm ) -1 , since e nn ^ 0 in view of n | n. 

It is clear from (15) that, if p is a prime, then 

(17) f{p k ) = Hf'iP 1 ), hence f'(p k ) = f{p k ) - f{p k ~ l ), 

2-0 

where k = 1,2, • • • and 

( 18 ) /( 1 ) = /'( 1 ). 

However, the values of the function can be assigned arbitrarily for every n , and 
(17), (18) do not determine the function values for any n divisible by more than 
one prime. 

Since (16) is identical with (15), every e nm is either 0 or 1, i.e., of the form 
\ d= J, where the sign is a single-valued function of n and m together. According 
to (15), the actual determination of this function appears to require the knowl- 
edge of all the divisibility properties of all the integers. Fortunately, this 
monstrous task can be avoided by approaching the matrix ( e nm ) vertically, and 
not, as in (15), horizontally. In fact, the function e nm = J =t \ can be described 
as follows: The elements of the m-th column of the infinite square-matrix 
(enm) form that periodic sequence, of period m, in which m — 1 zeros are followed 
by a single 1. Correspondingly, (15) is equivalent to 

(19) x/(m) = x = x rsvw. 

ra= 1 m= 1 |_^_1 m=l [_ 

where [x] denotes the greatest integer not exceeding x. 

The actual inventor of the linear transformation (15), that is, of the matrix 
(e nm ), was neither Mobius [64], nor Dedekind [15] and Liouville [62], but Eratos- 
thenes [77], [65] (needless to say, none of them used the terminology of linear 
transformations or matrices). In fact, the possibility of constructing the matrix 
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in terms of its periodic columns is a mere restatement of the proof of the sieve 
of Eratosthenes * An equivalent description of the periodic structure is sup- 
plied by the sequences which de Polignac [67] calls diatomic ( 8ta re/jLvw). As 
pointed out recently ([89], p. 147), even the group of the (bounded) non-singular 
Z)-matrices of Toeplitz [80] can be thought of as a representation of the sieve 
process. 

The Eratosthenian way of reading (15) corresponds to the replacement of the 
/2-integral by the L-integral and is therefore indispensable in questions relating 
to the distribution of the values / attained by the function. The simplest facts 
in this direction depend on the inequality 



( 20 ) 



| Z f(m) - 

i m=l 



n 



n 



Z f'(m)/m 

m = 1 



n 



^ z 



I f(m) I, 



which is obvious from (19), if, j [x] — x | <1 is applied to x = n/m. 

It is easy to extend (19) to multilinear forms. For instance, if {n, m] de- 
notes the least common multiple of n and m, and a the complex conjugate of a, 
thenf 



(21) 



Z I Km) | 2 = 



m= 1 



oo 



z 




- Z Z 

[m, l } ^ n 



(since [ ] = 0 when {m, 1} > n). In fact, it is seen from (15) that the simple 
sum on the left of (21) is identical with 



Z Z fW(c), 

d\n c\n 



i.e. Z Z *S( n)f'(d)]'(c ), 



* The construction of the Eratosthenian matrix ( e nm ) is independent both of the unique- 
ness of the factorization of n into prime powers and of the existence of an infinity of primes. 
Hence, both of these theorems of Euclid must be provable on the basis of a sole “graph”. 
In fact, all the divisibility properties of all the integers are expressible in terms of the graph 
consisting of the lower right quarter of an infinite planar lattice in which every m-th, and 
no other, rectangle of the m-th column is marked by an asterisk. For instance, n is a prime 
if and only if the n-th row of the graph contains exactly two asterisks. This 'particular 
application of the graph is the theorem of Eratosthenes; his proof, which is not given by 
Nichomachos [77], [65], depends on the whole of the periodic structure. 

t If /'(ft) = 0 for every n not contained in a sequence of integers ki , k 2 , ■ • • which are 
pairwise relatively prime, then, since {m, m] = m, and since fm, 1} — ml when m and l are 
relatively prime, the identity (21) is reduced to 



y, /(m ) 2 = 



m = l 




/'(m ) 2 + 







if / is real-valued. A revealing application of the latter relation was given by Tur£n [82]. 
In his case, ki , k 2 , • • • is the sequence of all primes. 

It is easy to extend Turin’s results to the case of an arbitrary sequence ki , k 2 , 
satisfying (ki , kf) = 1 for i ^ j. A corresponding remark holds for the results on additive 
functions, referred to in §67 below. 



ERATOSTHENIAN SUMMATIONS 



7 



where tv c d (n) denotes, for a given pair of positive integers d , c, the number of 
those positive integers not exceeding n that are divisible by d and by c , that is, 
by {d, c}. Since tv c d (n) is the integral part of the quotient n/{d, c}, the identity 

(21) follows. 

6. Let n(n) denote the n-th element in the first column of the matrix (e nm ) -1 , 
that is, of the matrix inverse to the matrix of (16). This means that, in virtue 
of the one-to-one correspondence (15) between / and /', the function \x is defined 
as follows: 

(22) /' = /x if /( 1) = 1 and f(n) = 0 for n ^ 1. 

More generally, if m is fixed, and if the function of n representing the n-th ele- 
ment in the m-th column of ( e nm )~ l is called /'(n), then the corresponding func- 
tion /(n) is 0 or 1 according as n m or n = m. Hence it is clear from the 
periodic structure of (e nm ) that (e nm ) -1 is the matrix in which the n-th 1 in the 
m-th column of (e nm ) is replaced by ju(n); it being understood that the zeros of 
(e n m) are zeros of (€ nm ) -1 also. In other words, the inverse matrix can be repre- 
sented in terms of the elements, n(ri), of its first column, as follows: 

(23) (enm) -1 = (ix(n/m)), where n(x) = 0 unless x = [x] 

(that is, id(n/m) = 0 unless m | n, where / is a fraction sign). 

Since (15) is identical with (16), it follows from (23) that the inverse of the 
linear substitution (15) is 

(24) fin) = E n (n/d)f(d), (n = 1,2, ■■■). 

d | n 

In particular, from (22), 

(24 bis) 1 = m( 1) and 0 = E n(d) if n > 1, 

d | n 

which means that the function /u(n) defined by (22) is the function /jl(u ) of 
Mobius. Correspondingly, the inversion of (15) is what Sylvester [74] called 
a theorem in logic (viz., an identity in Boolean algebra). 

Another way of identifying (22) with the explicit definition of n(n) follows 
by writing the term /'(d) of the sum (15) in the form 1 ./'(d), and observing then 
that, since 2 l.n~ s = f(s), the definition (15) is formally equivalent to 

(25) E/(w)n _s = f(s) Yjf'(n)n ~ s ; 

cf. (1). However, for reasons which will become apparent in §84, Dirichlet 
series will hardly be used in the sequel. 

It should be mentioned for later reference that the function usually denoted 
by A (n), that is, the coefficient of n~ s in the Dirichlet series of the logarithmic 
derivative of l/f(s), can, by (15), be defined as follows: 



(26) 



/'(n) = A (n) if f(n) = log n. 
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7 . It is easily verified from (15) that (19) can be generalized to 

(27) E/M = E [n/m]f'(m) + E F'(n/m) — [n/l\F'(l), 

m= 1 m= 1 m= 1 

where l is any positive integer not exceeding n, the signs / are fraction signs, 
the brackets refer to greatest integers (so that the second summation on the 
right is to be arrested atm = [n/l]), finally F'(n) denotes the sum function 

(28) F'(n) = 

m= 1 

(it being understood that F'{x) = F'([x]) when x is not an integer) . The identity 
(27), which goes back to Dirichlet [18], [19], leads to easy and unified proofs, 
and at the same time to generalizations, of a whole class of classical estimates. 

In order to see this, suppose first that/'(n) = 0(1), and choose l = [ri*] in 
(27). Then omission of the brackets in the first sum on the right of (27) intro- 
duces the error [?i 5 ]0(l) = 0(jF). Hence 

(291) E /M = n E f'(m)/m + E F'(n/m) - n h F’{n h ) + 0(n*). 

m= 1 m= 1 m= 1 

For instance, if f'in) = 1 for every n, then f(n) is, by (15), the number, d(n), 
of all divisors of n, and (28) is reduced to F'(n) = n. Hence, (29i) becomes 

^2 d(m) =2 n ^ — + 0(n h ) = 2n(\og n h + C) — n + 0(rF), 

m= 1 m= 1 

which is Dirichlet ? s estimate in his divisor problem. 

Next, suppose that F'(n) = 0(1). Then it is clear from (28) that the assump- 
tion, f'(n) = 0(1), of (29i) is satisfied. But (29i) now implies that 

n oo 

( 292 ) E/( m ) = f{m)/m + 0{n h ) = «E + 0{n h ). 

vi= 1 m= 1 m= 1 

In fact, it is seen from (28) by partial summation that, since F'(n) = 0(1), the 
infinite series 2/' (m)/m is convergent and is approximated by its n- th partial 
sum with an error OirT 1 ). 

For instance, if f(n) is ( — l) Kn_1) .4 or 0 according as n is odd or even, then 
fin) is, by (15), the number, r 2 (n), of the representations of n as a sum of two 
squares (cf. §71 below). Hence, (29 2 ) is reduced to 

r 2 (m) = 4n XI ( — l) 7n_1 /(2m — 1) + 0(?T) = irn + 0(w 5 ), 

m= 1 m= 1 

which is Gauss 7 estimate in his lattice problem. 

The assumption of (292) was the boundedness of the partial sums, (28), of the 
series 2 f' (m) . Suppose now that this series is convergent. Then it is clear 
from the passage from (29i) to (29 2 ), that (29 2 ) can be refined to 

n n oo 

(29 3 ) E/M = n E + o(ri k ) = n E f(m)/m + o(n J ). 

m= 1 m= 1 m= 1 
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For instance, if f(n) = \ n(n) |, the convergence of 2/'(n) is readily seen to 
mean the convergence of 2 n(n)/n (cf. the first part of §53 below), and so (29 3 ) 
represents that restatement of the prime number theorem to which Landau 
devoted pp. 604-609 of his Handbuch. 

The assumption of (29 3 ) neither implies, nor is implied by, the estimate 
f'(n) = 0(l/n); an estimate which does not even entail the estimate F'(n) = 
0(1), supposed by (29 2 ). However, if f f (n) = 0(l/n), it is clear from (20) that 

n n n oo 

(29i) E f(m) = «y + ^0(l/m) = + 0(log n). 

m=l m= 1 m= 1 m= 1 

Since /'(n) = 0(l/n), it is seen from (294) by partial summation that 

n 00 

(294 bis) X mf(m) = hn X f r (m)/m + nO( log n). 

m= 1 m= 1 

On the other hand, since f'(n) = 0(l/n), partial summation of (294 bis) gives 
only O(log 2 n), instead of 0( log n), in (294). Thus the true estimate belonging 
to the assumption /'(n) = 0(1 /n) is (294), and not (294 bis). 

For instance, if f'(n) = 1/n or f'(n) = n(n)/n , then (15) shows that/(n) = 
a(n)/n and f(n) = 0(n)/n respectively, where a {n) denotes the sum of the divi- 
sors of n and 0(n) is Euler^s function. Thus (294 bis) is reduced to 

n n 

E a{m) = |w 2 f( 2) + nO{ log n) and y 4>(m) = \n/$(2) + n0{ log n), 

m= 1 m= 1 

the standard estimates of Dirichlet and Mertens. In contrast to the preceding 
estimates of classical sum functions, neither of the latter estimates has ever 
been improved. On the other hand, it has never been proved in either case that 
the 0( log n) is not o(log n). 



The Pair of Eratosthenian Summation Methods 



8. Let f{ri) be arbitrary. Since [n/m]/n — > 1/m as n — > 00 if m is fixed, 
(19) implies the formal connection 



(30) 



lim - X f( m ) = - X 



n 

m 



f'(rn) = y 

m=l 771 



if the limit process is carried out term-by-term. 

The evaluation of M (/) resulting from (3), (30) in the series form 2 (m ) /m 

appears to have been known to Gauss, who has used it freely (for instance, in 
his celebrated fragment [27] which, having been posthumous, was not at the 
disposal of the parallel and more inclusive work of Dirichlet; in fact, Dirichlet 
[16] knew only of what the Disquisitiones Arithmeticae [26] stated without proof 
concerning certain averages). 

Correspondingly, the sieve process that Dedekind [14], in his commentaries 
to Gauss* fragment, calls the successive elimination of primes, is nothing but 
(15), followed by the formal limit process applied in (30). Dedekind also 
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observes that the elimination of all the primes, that is, the limit process cor- 
responding to the formal step 



(31) 



lim Z-f-l/'W =t lim- 

n — >oo m= 1 71 |_W_| m= 1 n—*oo 71 




/'(m) 



in (30), needs legitimization. In the particular cases considered by Gauss, 
cases in which f'(n) is substantially a non-principal character, Dedekind justifies 
this step by a partial summation (actually, the assumption of (292) is satisfied 
in these cases). 

Cesaro has rediscovered the Gaussian principle and, being free of Dedekindian 
inhibitions as to (31), has applied (30) extensively; cf. [9], [10]. 

According to (3), the claim made by (30) is that M(f) = 2/'(m)/m holds 
whenever either side of the latter equation exists. This claim is three-fold: 

(I) M(f) 9 ^ 2 is impossible if the mean M(f) exists and the series 
2/'(m)/m is convergent; 

(II) M (/) must exist if 2/'(m)/m converges; 

(III) 2 must converge if M(f) exists. 

The object of this chapter is to show that, though (I) happens to be true, 
both (II) and (III) are false. It will be seen in §13 and §16 that even the truth 
of (I) is due only to a fortunate coincidence. 



9 . If the/(n) and/'(n) are replaced by the functions 
(32) F(n) = - X fW and F*(n) = X , 

71 m= 1 m= 1 771 

then the relation (15) defining the connection between /(n) and f'(n) is equiva- 
lent to 

"1 771 + l] 






(33) F(n) = ± ([?]«- [-All 

m= 1 71 \_77l + 1J 

In fact, substitution of (32) into (19) gives 

F{n ) = - Z I” — j m{F*(m) - F*{m - 1)!, 

71 m= 1 J 



r 7i ~ 

a relation identical with (33), since — - — - 

+ i. 

The following considerations depend on t 
formation, 



= 0 and F*( 0) = 0, by (32). 
le fact that (33) is a linear trans- 



(34) F(n) = X anm F*(m) or F(n) = X «nm^*(m), 

m= 1 m = 1 

belonging to the matrix formed by the absolute constants 

771 + 1 



(35) 



_ [ ti~] 77i r 7i n 

anm \_77l\ 71 \_77l + lj 



{a nm = 0 for m > n). 



71 
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According to (32), the existence of M (/) and the convergence of 2/'(n)/n are 
equivalent to the existence of the limits, F(oo) and F*( °o), respectively. It 
follows therefore from (34) that the investigation of any connection between 
the existence of M(f) and the convergence of 2f'(n)/n must be reducible, at 
least in principle, to the general theory of linear summation methods. 

The simplest result attainable by this approach is as follows: 

(I) If M{f) exists and if 2/' (n)/n is convergent , then M(f) = 2 f'(n)/n . 

In view of the remark following (35), the assertion of (I) is that the summation 
process (34) defined by the matrix (35) satisfies the conditions of consistency. 
The latter are known to be two-fold, namely 

n 

(i) lim ^ a nm = 1; (ii) lim a nm = 0 for m = 1, 2, • • • . 

n — >oo m= 1 n—+oo 

But it is clear from (35) that the sum occurring in (i) is 1 for every n , and that 

(ii) also is satisfied, since [x]/x — > 1 as x — > oo . 

10. On p. 115 of his book, Cesaro [9] attempted a short proof of a general 
theorem which, in terms of the notations (3) and (15), can be formulated as 
follows: If the series 2/'(n)/n is convergent and if /'(l) + • • • + f'(n) = o(n), 
then M(f) exists. However, his proof of the theorem is readily seen to be 
erroneous. It turns out that the theorem itself is false. 

First, the assumptions of the theorem are tautological, since, according to 
(4), the o-condition of Cesaro is always implied by the convergence of 2/'(n)/n. 
Hence, what the theorem actually states is that the convergence of 2/'(n)/n is 
sufficient for the existence of M(f). In view of (I) and of the remark following 
(35), this can be expressed by saying that the summation process (34) defined 
by the matrix (35) is regular in the sense of the theory of Toeplitz ([78]; cf. 
Schur [70]). But then the sums corresponding to Lebesgue's constants in the 
theory of Fourier series must be bounded. In other words, not only the condi- 
tions (i), (ii) of (I) but also the third condition of Toeplitz, namely 

n 

(iii) E|anm|=0(l) as n— >°o, 

m= 1 

must be satisfied by (35). Conversely, it is clear from (I) that (iii) is not only 
necessary but sufficient as well for the truth of Cesaro’s theorem. 

It will now be shown that (iii) is violated by (35). 

(II) The convergence of 2/'(n)/n does not imply the existence of M(f). 

Actually, it turns out that the sum (iii) belonging to (35) exceeds a constant 

multiple of log n. Incidentally, it will be seen from the proof that (iii), just as 
in the theory of Fourier series, becomes true if 0(1) is replaced by 0(log n). 

Let n have a fixed value. It is clear that, if k is a positive integer not greater 
than n, a positive integer m satisfies the pair of conditions 



12 



ERATOSTHENIAN AVERAGES 



if and only if it fulfills both inequalities 



n 

F+T 



. ^ n 

< m S — 1. 

k 



Hence, if these inequalities are satisfied, then (35) can simply be written as 



= k 



m 



m + 1 



n 



which implies that \ a nk \ = k/n. Accordingly, if l nk denotes the number of the 
integers m satisfying both inequalities which belong to a given k and to the 
fixed n, then, since k = 1,2, • • • , n, 

n n 

^ y | | = ^ ^ Inkk/fl ^ ^ I Ink k/n. 

m = 1 k=l k = 1 



It is understood that the first of the two signs ^ is introduced by the omission 
of those summation indices which do not satisfy any of the pair of inequalities 
belonging to the various values of k and to the fixed n (and that the summation 
limit n * refers to k = [n*]). But it is clear from the definition of l n k that 



Ink = T' 1 



n 



— 1 = 



- 2 , 



k k + 1 k(k + 1) 

which implies that l n kk/n ^ (k + 1) _1 — 2 k/n. Hence 

it, \anm \ ^ ^ {(k + l) -1 — 2 k/n} > k~ l — 2 X) k/n, 



and so, since 2^ k = 0(x 2 ) 



asx-> oo , 



> £ k 1 — 0(ri)/n = log (n*) + 0(1) 



as n — > oo . Since this contradicts (iii), the proof of (II) is complete. 



11. Since (35) implies that a nn = 1, the transformation (34) of F* into F 
has a unique inverse, say 

n oo 

(36) F*(n) = y b nm F(m) or F*(n) - X b nm F(m), 

m = 1 m=l 

where b nm = 0 for m > n. Corresponding to (35), the infinite matrix (b nm ) = 
(< a nm )~ l consists of certain absolute constants. Some of the b nm are negative, 
as are, by (35), some of the a nm ; so that neither of the summation methods 
(34), (36) belongs to “weights”. 

It will now be shown that what (II) states for (34) holds for (36) also. 

(Ill) The existence of M(f) does not imply the convergence of 2 f'(n)/n . 

In order to prove (III), it would be possible to proceed in the same way as in 
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§10, that is, to show that the Lebesgue-Toeplitz condition (iii), §10 remains 
violated if a nm is replaced by b nm . However, it is seen from (36), (32) and (24) 
that the explicit representation of b nm involves awkward combinations of the 
Mobius function. It is therefore convenient to carry out in another form the 
considerations on which the Lebesgue-Toeplitz criterion depends. To this 
end, it is sufficient to observe that, according to (4), the negation (III) is a 
corollary of the first part of the following statement: 

(III*) The convergence of 2/(n)/n is insufficient and unnecessary for the con- 
vergence of 2/' (n)/n. 

The truth of the second part of (III*), which is not needed for (III), follows 
by choosing f'(n) = 0 for every n > 1, since /(n) then is the constant /' (1 ) , by 
(15). In order to prove the first part of (III*), it is sufficient to observe that, 
according to (25), the series H,f'(n)/n is the Dirichlet product of the series 
2 u( n )/ n and 2f(n)/n (in fact, (25) and the definition (22) imply that the 
Dirichlet series of l/f(s) is 2 y(n)n~ s ). Thus the assertion of the first part of 
(III*) is that there exist convergent numerical series whose Dirichlet product 
with the series 2 pt(n)/n is a divergent series. But 2 yu(n)/n is not absolutely 
convergent, since 2 | ^(n) \/n ^ 2 | u(p) |/p, where p(p) = —1 for every prime p. 
Hence, the existence of numerical series of the type required follows from the 
converse of the multiplication theorem of Stieltjes; a converse established by 
Schur [70] exactly along the lines of the Lebesgue-Toeplitz principle. 

12. It is seen from the remarks following (35), that (III), (II) and (I) together 
can be expressed as follows: The linear summation methods defined by the 
reciprocal transformations (34), (36) belonging to (35) are incomparable , though 
consistent. 

The standard instances of consistent but incomparable summation methods 
are supplied by the processes of Cesaro-Holder and of Euler or of Borel. How- 
ever, the present instance of incomparable consistency is more striking, since 
the processes (34) and (36), being reciprocal, form an involutory pair. However, 
as pointed out at the beginning of §11, the “ weights” are now negative in part. 

It seems to be of particular interest that the pathological situation represented 
by the incomparable character of the consistent involution (34), (36) is inherent 
in the sieve of Eratosthenes (cf. (30), (31) and §5). In fact, it is clear from (32) 
that the reciprocal mates (34), (36) represent a mere transcription of the classical 
pair (15), (24). Thus the “pathology” cannot now be excused by the “artificial” 
nature of the analytic smoothing processes of Euler or of Borel. 

Euler Products 

13 . Most of the functions f{n) considered in the classical literature (cf . §7— §8) 
are multiplicative, that is to say such that, if (n \ , n 2 ) denotes the greatest com- 
mon divisor of n\ and n 2 , 



(37) 



fin x nf) = f(ni)f(n 2 ) if (n x , n 2 ) = 1. 
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This implies that, if the trivial case/(l) = /( 2) = • • • = 0 is excluded, 

(38) /( 1) = 1; hence /'( 1) = l,by (18). 

According to (37), a multiplicative fin) is uniquely determined for every n 
by an arbitrary assignment of a double sequence of values f(p k ) , where p and k 
range over all primes and over all positive integers respectively. It is easily 
verified from (15) that 

(39) / is multiplicative if and only if /' is. 

Thus (15) or (24) is now equivalent to the identity 

(40) f(p k ) = Z/V), i.e. f(p k ) =f(p k ) - /(p*- 1 ), k > 0, 

J=0 

(which, according to (17), is true, but incomplete, if (37) is omitted). 

If fin) is multiplicative, the same is true, by (39), of f'(n)/n. Hence, if 
2 f'{n)/n is convergent, a formal application of Euler’s factorization gives 

(41) Z f(n)/n = II jl + Z f(p)/p k \; 

n = 1 p ( k—1 J 

cf . (50) below. But then (40) and (38) show that the Gaussian principle M (J) = 
2 f'(n)/n (cf. §8) can be written in the form 

(42) M(f) = n Z {f(p k ) - /(/ _1 )}//, /(l) = 1 , if Kp- 1 ) = o. 

p k=0 

(42) admits of an interpretation in terms of the product rule of “independent 
probabilities” (cf. Wintner [92]). In fact, let f g (n) denote, for a fixed prime g, 
the multiplicative function for which the double sequence { {/q^)} } is given by 

(43) /*(/) = ! if and f Q (q k ) = f(q k ), 

where k = 1, 2, • • • . Then, if (42) is applied to / = f q , it follows that (42) 
itself can be written in the form 

(44) M(f) = n M(J p) . 

P 

On the other hand, from (43), (38) and (37), 

(45) f(n) = Jl fpin) for every n 

v 

(it being understood that the infinite product (45) has only a finite number of 
factors distinct from 1, if n is fixed). It is the formal parallelism between (44) 
and (45) on which certain considerations of Sylvester [73], [76] appear to depend. 

Actually, (42) presupposes not only the validity of the Gaussian principle 
(§8) in the multiplicative case, a principle disproved by (II) and (III) in the 
general case, but also the legitimacy of the Euler factorization (41). It will now 
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be shown that (42) can fail in every sense, that is, that all three statements repre- 
senting the Eulerian analogues of (I), (II), (III), §8 are false. 



14 . A realization of the first of these three contingencies can be made to 
depend on the machinery of the prime number theorem, as follows: 

The proto-Tauberian direct proof of n( 1) + • • • + pin) = o(n), where 
2 n(n)rT s = l/f(s), is based on an estimate of the function l/f(s) on a domain 
containing the line a = 1. Since the same estimate is trivial for the function 
f (s) itself and therefore for the function f (2 s — 1) as well, it is available for the 
product of l/f(s) and f(2 s — 1) also. Hence, if l/f(s) is replaced by 
f(2s — 1 )/f(s) in the proof of /x(l) + • • • + ju(n) = o(n), it follows that the sum 
of the first n coefficients in the Dirichlet series of f(2s — l)/f(s) is o(n). This 
implies that the latter Dirichlet series is convergent at s = 1, since the function 
f (2s — l)/f(s) is regular at s = 1 (Fatou-M. Riesz; cf. §24 below). 

It is convenient to restate this result in a simpler form, as follows: If gin) 

is defined by 

(46) E g(n)rT s = G(s), where (?(s) = H /l - ^ + Fl, (<r > 1), 

p l P V ) 

then 



(47) gr(l) + • • • + g(n) = o(n). 

In fact, two-fold application of f(s) = n (1 — p s ) 1 gives 

V 

/ 4 o\ f(2s — 1) _ tt f ^ ■ _P _ ^ i 

f(s) p \ p s p 2s p 3s p 4s p 5 * 



where a > 1. Clearly, the quotient of the products (46), (48) is absolutely 
convergent, and represents a non-vanishing regular function, in a half-plane 
containing the line a = 1 in its interior. In particular, the quotient has a 
Dirichlet series that is absolutely convergent at s = 1. Since the Dirichlet 
series of the function (48) is convergent at s = 1, it follows from the multiplica- 
tion theorem of Stieltjes, that the Dirichlet series of the function (46) is con- 
vergent at s = 1. In other words, 2 g{n)/n is convergent. Hence, (47) follows 
from (4). 

It should be noted for later reference that the function (2s — l)/f (s) attains 
at s = 1 the value 



(49) {r( 2 * - i )/roou = i, 

since (z — l)f(z) — > 1 as 2 — » 1 and (s — l)/(2s — 1 — 1) = 

15 . According to Euler [24], the formal factorization belonging to an arbitrary 
multiplicative function g(n) is 

E g(n) = II {1 + E g(p k ) ) • 

V \ k - 1 J 



( 50 ) 
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If g ^ 0, then (50) is valid whether 2 gin) < co or 2 g(w) = oo. Hence 

(51) E I g(n) I < 00 if and only if E E I g(v k ) I < 00 • 

p k = 1 

If 2 gr (ft) is absolutely convergent, then (50) is valid and, in addition, the 
product and each of the sums on the right of (50) are absolutely convergent. 
However, all of this can hold even if 2 <7 (ft) is divergent. This is shown by the 
example 

(52) g{p) = -1, g(p 2 ) = 1, g{p k ) =0 if Jc, > 2. 

In fact, every factor { ) in (50) is then 1, although not even gin ) — » 0 is satisfied. 
On the other hand, if 

(53) g{p k ) = (-irV* 

then 2 < 7 (ft) is convergent, since gin) = ( — 1 ) n_ 1 ft~*, although the product (50) 
becomes 



v > 2 



E2 H n <1 + e?> “ 1 = - 



The examples (52) and (53) prove the second and the third of the three nega- 
tions italicized at the end of §13; negations which can be formulated as follows: 

(IV) If g{n) is a multiplicative function, then 

(I') the convergence of 2 < 7 (ft) and the convergence of its Euler product do not 
imply the equality (50) ; 

(ID the convergence of 2 gin) does not imply the convergence of its Euler product ; 

(HI') the convergence of 2 <7 (ft) is not implied by the convergence of its Euler 
product ; 

and all of this holds even if each of the series { } on the right of (50) is required to 
converge absolutely. 

(I') can be deduced from §14 by choosing 
(54) g(p u ) = p~ k , g(p 2k ~ l ) = -p~ k 



(cf. also §55 and §64 below). In fact, it is clear from (54) that the Euler factoriza- 
tion of the Dirichlet series 2 f(n)n~ s belonging to /(ft) = ng(n) is identical with 
the product (48), where a > 1 . Hence the result of §14 means that 2 <7 (ft) 
is a convergent series. According to (49), the value represented by 2 g{n) is J, 
since the function 2 ng{n)n~ s = f( 2 s — l)/f (s) is continuous at s = 1 and so, 
by Dirichlet's analogue of Abel's continuity theorem, 

( 55 ) 2 g{n)n~ € — > 2 < 7 (ft) as e — > + 0 . 

On the other hand, (54) shows that the factor { } of the product (50) is 
1 — p~ l + p~ l — p~ 2 + p~ 2 — • • • = 1 for every p . Hence (50) claims that 

i = 1 . 
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16 . It is clear from this proof that what is responsible for the first item of (IV) 
is the failure of the analogue of the Abelian theorem (55) in the case in which, 
under the assumption that g(ri) is multiplicative, both series (55) are replaced 
by their Euler products. In fact, what is accomplished by (54) is that the 
Euler product 

(56) P(e) = n {l + E Q(v k )v~ k ^\ 

v \ *=i J 

of the Dirichlet series 2 g(n)n~ e converges both for e > 0 and for e = 0, and the 
limit P(+0) exists, but P(+0) ^ P(0). This situation was met by Hardy 
[35] in a case numerically more involved than, but otherwise of exactly the same 
type as, the example (54) ; cf . the end of §55 below. 

A possibility distinct from the one just described could be this: The Euler 
product (56) of 2 g{n)n~ e converges for both e > 0 and e = 0 but the analogue 
of the Abelian theorem fails for the reason that the limit P(+0) does not exist. 

Actually, the discrepancy between the behavior of 2 g{n)n~ s and of its Euler 
factorization P(s) is so far-reaching that P(s) need not even have an abscissa 
of convergence. In other words, if P(s) converges at s = 1, it need not converge 
when a > 1. In fact, it is possible to have for P(e) an isolated convergence 
point. For instance, if 

(57) g(p) = p, g(p 2 ) = -p 2 , g(p k ) =0 if k > 2, 

then every factor { } of (56) is 1 at e = 1, although (56) is divergent when 
either l<e^2or0^e<l. 

It would be interesting to know whether an Euler product P(s) can or cannot 
overconverge its Dirichlet series; in the sense that the product P(s) converges, 
and represents a regular function, in a half-plane a > 6, where 6 is less than the 
convergence abscissa of 2 g(n)n ~ s . 

17 . The above counter-examples depend on a retroaction of what is contri- 
buted to (50) by those prime powers, p 2 , p 3 , • • • , which are not primes, p; a 
retroaction excluded in standard situations, characterized by 

(58) II E I g(p k ) I < 00 . 

v h= 2 

If (58) is satisfied, the classical condition, (51), for the validity of (50) re- 
quires the absolute convergence of 2 g(p). This requirement is rather crude, 
since it excludes, among other things, every case in which (50) is valid but 
either the product (50) or the series 2 g(n) is conditionally convergent. A 
criterion free of this objection may be formulated as follows: 

(V) If g(n) is a multiplicative function satisfying 2 | g(p) \ 2 < oo and g(p 2 ) = 
9(V*) = • • * 0, then either the product (50) and the series 2 g(n) are convergent 
and have the same value or both the product (50) and the series 2 g{n) are divergent,, 
according as the series 2 g(p) is convergent or divergent , 
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The assumption g(p k ) = 0 for k > 1 is made only for the sake of simplicity; 
it could be replaced by a summary assumption corresponding to (58). As it 
stands, it simplifies the product (50) to n(l + a,-), if a* = g(p), where p = pi 
denotes the z-th prime. It follows therefore from the other assumption, 
2 | g(p) | 2 < oo, of (V) and from Cauchy’s criterion for the convergence of a 
product n(l + a,i), that the convergence of the product (50) is equivalent to 
the convergence of the series 2 g(p). 

Thus it is clear that, in order to prove (V), it is sufficient to verify 
that 2 | g(p) | 2 < oo implies 2' | g(ri) | < oo, where the accent refers to the 
omission of those summation indices n which are primes. But since g(p k ) = 0 
for k > 1, the indices n occurring in 2'g(n) are those square-free integers which 
are not primes. Accordingly, if (pi , • • • , p 3 ) runs through all j-tuples of 
distinct primes in the sums 

sy = 2 | gf(pi) • • • g(pj) | and t } = 2 | g(p i) • • • g{pj) | 2 , 

then the assumption and the assertion are ti < <*> and s 2 + S 3 + «4 + • • • < 00 
respectively. But s 2 + S 3 + $4 + • • • < °° is readily seen to be implied by 
h < °o. 

In fact, the Schwarz inequality shows that Sj ^ (Wy _ iY for every j > 1, and 
it is obvious that tj ^ Wy_i . Hence s 2 + s 3 + s 4 + • • • is majorized by a con- 
vergent geometric series, if ti < 1 . But the assumption ti < 1 involves no loss 
of generality. This is seen by observing that ti = 2 | g(p) | 2 , and that the 
assumption 2 | g(p) | 2 < 00 of (V) can be replaced by 2 | h(p) | 2 < 00 , if h(n) 
is a function satisfying h(p) = g(p) for all but a finite number of primes, and 
h(p ) = 0 for the remaining primes. Since the replacement of g(n) by such an 
h(n) cannot affect the truth of (V), the proof of (V) is complete. 

Tauberian Theorems for the Eratosthenian Summation Methods 

18. For an arbitrary /(n) or/'(n), the functions F(n), F*(n) defined by (32) 
represent the n - th approximations to the limits F(o o) = M(J), F*(oo) = 
2 f'(n)/n, if any. According to (34) and (36), the transition from either of the 
functions F(n), F*(n) to the other depends on a linear summation process. 
And (I) states that these two Eratosthenian summation processes are consistent. 
On the other hand, (II) and (III) state that Eratosthenian summability of either 
kind does not imply Eratosthenian summability of the other kind. Thus there 
arises the need for Tauberian restrictions under which Eratosthenian sum- 
mability of the first kind implies Eratosthenian summability of the second kind, 
and for Tauberian restrictions under which the converse is true. Such Tau- 
berian theorems form the subject of this chapter. 

It turns out that Tauberian theorems of the one kind (those leading from 
2 f'(n)/n = lim F* to M(f) = lim F) are, in the main, elementary in nature, 
while the principal Tauberian theorem of the other kind (leading from M(f) = 
lim F to 2/'(n)/n = lim F*) involves the prime number theorem. Actually, 
the latter Tauberian theorem implies an extension of the prime number theorem 
to the case of an arbitrary sequence of primes; cf. §77 below. 
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19. It is easily verified that, corresponding to the formal identity (25) be- 
tween Dirichlet series, the relation (15) defining the connection between / and/' 
is formally equivalent to the Lambertian identity 

(59) 2/(n)r n = Z/'(n)r7(l - r n ), 

where the range of the summation indices is that specified by (1). The formal 
derangements identifying the two series (59) also show that either both the 
power series (59) and the Lambert series (59) are, or neither of these series is, 
convergent (or, what is the same thing, absolutely convergent) for r < 1. 

Due to (59), both Tauberian problems of §18 become reducible to classical 
Tauberian problems in the summation theory of series. 

First, it is clear from (3) that M(J) exists if and only if the series 

(60) 2h(ri), where h(n) = fin) — fin — 1) if n > 1 and h(l) = /(l), 

is summable (C, 1). Furthermore, since the n-th partial sum of the series 

(60) is fin), 

(61) . (1 — r) -1 2 h(n)r n = E/(n)r n . 

Hence, the Abelian summability (A) of the series (60) means that the power 
series S/(n)r n converges for r < 1 and that the product 

(62) (1 — r)2/(n)r n 

tends to a limit as r — > 1. Correspondingly, since (1 — r)r n /(l — r n ) — > 1/n as 
r — » 1, the Lambertian summability (L) of a series, say of 2 gin), is defined by 
the property that the Lambert series 2 ngin)r n /il — r n ) converges for r < 1 
and that the product 

(63) (1 - r) E ng(n)r n /(\ - r n ) 
tends to a limit as r — » 1. 

The fundamental theorem on the three summation processes (C, 1), (L), (A) 
is that summability (L) implies summability (A) and is implied by summability 
(0, 1) [and, incidentally, even by summability (C, N)]. The first of these two 
facts, which is due to Hardy and Little wood [37], lies somewhat deeper than the 
prime number theorem. The second statement is elementary; cf. Hardy [34]. 
Any two of the three summation processes are consistent and inequivalent. 

20. A few elementary facts will now be collected. 

(VI) If | /.'(l) | + • • • + | fin) | = 0(n), and if M{f) exists , then 

(64) - E/(m) - E f — -r'(2 )M{f) as n -> « . 

n m = i m = i m 

According to (2), the O-assumption of (VI) can be written in the form 
Nif') < oo. Hence, the assumptions of (VI) are precisely those made for 
g = f before (9). But (19) shows that (64) is identical with the case g = f 
of (9). 
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In view of the case g = f of (4), the Axerian lemma (VI) contains the following 
corollary, which is Tauberian with reference to (II). 

CVII) If |/'(1) | + • • • + \f'(n) | = 0(n), then the convergence of 2f(ri)/n 
implies the existence of M (/) . 

(VII) is a Tauberian refinement of one part of the following criterion, both 
parts of which are trivial from (20) and (3). 

(VIII) If | /'(l) ! + •••+ | fin) | = o(n), then the convergence of 2f'(n)/?i 
is equivalent to the existence of M (/) . 

It is clear from (VIII) and from the case g = \f | of (4), that the absolute 
convergence of 2/'(n)/n is sufficient for the existence of M(f). However, it 
will be seen in §27 and §33 that the existence of M (/) is not the true consequence 
of the absolute convergence of 2/' (n)/n. 

21. The proof of the following criterion, a criterion Tauberian with reference 
to (HI), will involve the prime number theorem (cf. §19). 

(IXi) If f'{n) = Ol(1), then the existence of M(f) implies the convergence of 
2f'(n)/n. 

In fact, suppose that M(f) exists. This means that the series (60) is summable 
(C, 1), and so it is summable (A); so that the product (62) tends to a limit as 
r — > 1. It follows therefore from (59) that the product (63) belonging to g(n) = 
f(n)/n tends to a limit as r — > 1. In other words, the series 2f(n)/n is sum- 
mable (L). Consequently, it is summable (A). Hence, in order to complete 
the proof of (IXi), it is sufficient to apply to the series 2f'(n)/n an elementary 
Tauberian theorem of Hardy and Littlewood, according to which a series is 
convergent whenever it is summable (A) and such that its n-th term is 0 L (l/n) ; 
cf. Karamata [52]. 

The deep, though Abelian (that is, not Tauberian) theorem according to 
which summability (L) always suffices for summability (A) will not be used in 
the proof of the following fact, which represents a dual , instead of being a 
converse , of (IXi), and is Tauberian with reference to (II). 

(IX 2 ) If fin) = 0 L ( 1), then the convergence of 2/' (n)/n implies the existence 
of M if). 

In fact, suppose that the series 2f(n)/n is convergent. Then it is summable 
(L). This means that the product (63) belonging to gin) = fin)/n tends to a 
limit as r — > 1. It follows therefore from (59) that the product (62) tends to 
a limit as r — > 1. In other words, the series (60) is summable (A). On the 
other hand, the existence of M if) means that the series (60) is summable (C, 1). 
Hence, in order to complete the proof of (IX 2 ), it is sufficient to apply to the 
series (60) an elementary Tauberian theorem of Hardy and Littlewood, accord- 
ing to which a series is summable (C, 1 ) whenever it is summable (A ) and such 
that its n-th partial sum is 0 L (1) ; cf. Karamata [52]. 

(IX 2 ) applies in the same direction as (VII) but is independent of (VII). 
In fact, even if 0^(1) is replaced by 0(1) in (IX 2 ), all that results from (24) is 
fin) = 0 id (n)), where d(n) denotes the number of the divisors of n. Since d(l) 
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+ • • • + d(n) ~ n log n , it follows that the O-condition of (VII) is short by the 
factor log n (which, as a matter of fact, is the same log n as the one mentioned 
after the italicized result in §10). 

Incidentally, it is clear from the proof of (IX 2 ) that the assertion of (IX 2 ) 
remains true if the assumption f(n) = 0 L ( 1) is replaced by any Tauberian 
restriction under which the summability (A) of the series (60) implies its sum- 
mability (0, 1). 

22. Inasmuch as (IXi) and (IX 2 ) are not converse mates, it is natural to ask 
for Tauberian conditions under which the two Eratosthenian summation 
processes are equivalent (cf. §18). A criterion of this type runs as follows: 

(X) If f'(n) = 0(1), then the convergence o/ 2/'(n) /n is equivalent to the existence 
of M(f). 

In fact, the O-assumption of (X) implies that of (VII). Hence, one part of 
(X) follows from (VII). The other part of (X) is contained in (IXi). 

Another criterion of the same type as (X) can be formulated as follows: 

(XI) If /'^ 0, then the convergence of 2 f'(n)/n is equivalent to the existence 
of M(f). 

In fact, it is clear from (15) that / ^ 0 is implied by/' ^ 0. Hence, (XI) 
covers a domain common to (IXi) and (IX 2 ). 

It will be noted that neither (X) nor (XI) contains (VIII), although (VIII) 
is a triviality. However, it is seen from the remark made at the end of §21, 
that the Tauberian assumption of (IX 2 ) could be replaced by one concerning 
“slow oscillation” ; and a corresponding remark holds for the dual theorem, (IXi). 
In fact, the assumption /'(n) = 0 L ( 1) was used only at the end of the proof of 
(IXi); its role was that of assuring the convergence of the series 2/'(n)/n, which 
was proved to be summable (A) without the assumption /'(n) = 0 L ( 1). Hence, 
(IXi) remains true if f'(n) = O L ( 1) is replaced by any Tauberian condition in 
virtue of which the summability (A) of S/'(n)/n implies the convergence of 
2/'(n)/n. 

23. Since such a Tauberian condition is supplied by the “high-indices theorem” 
of Hardy and Littlewood (cf. Ingham [47]), there results one part of the follow- 
ing variant of (X) and (XI) : 

(XII) . If there exist a sequence {k m } and a constant X such that 

(65) f'(n) = 0 unless n = ki , fc 2 , • • * , where k m +i/k m > X > 1, 

tluen the existence of M(J) is equivalent to the convergence of 2 f f (n)/n. 

The part of (XII) remaining to be proved states that the convergence of 
2 f(ri)/n implies the existence of M(f), if (65) is satisfied. But (65) implies 
that, as n — » oo, 

T. k m = 0(n); whence 2 o(k m ) = o(n) 

k m <n k m <n 



( 66 ) 
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follows by submerging an e in the usual way. Since the convergence of 2/'(n) /n 
implies that/'(n)/n — > 0 as n — > oo, which in turn implies that /'( k m ) = o(k m ) 
as m — > oo , it follows that, as n — ■» °o , 

(67) E \f'(k m )\ = o(n), and so E l/'(0 I = o(n), 

k m <C w i=l 

by (05). Hence, in order to complete the proof of (XII), it is sufficient to show 
that, if 2 f'(n)/n is convergent and if the last o-condition, that is, | /'(l) | + • • • + 

| /'(ti) | = o(n), is satisfied, then M(J) exists. But this follows from (VIII). 

Along the lines of an arithmetical construction of Toeplitz [79], the criterion 
(XII) was recently proved for the particular case k m = where p is a fixed 
prime (van Kampen and Wintner [51]). It now turns out that the true theorem 
depends only on the lacunary structure of the function /'(n) and has therefore 
nothing to do with the arithmetic assumption of a p-adic algorithm. 

24. The criteria (VII)-(XII) supply a sufficient explanation of the reasons 
for which the counter-examples establishing (II) and (III) had to be “con- 
structed”. It is worth mentioning that there is another, namely function- 
theoretical, approach to an explanation of this necessity. 

Suppose that the signs of absolute value are omitted from the o-condition of 
(VIII). The resulting o-condition, which is identical with the tautological 
assumption of Cesaro (cf. the beginning of §10), can be written in the form 
M (/') = 0. According to (4), this is a necessary condition for the convergence 
of 2/'(n)/n. But if this trivial necessary condition is satisfied, then 2f'(n)/n 
must be convergent as soon as the function represented by the Dirichlet series 
Xf'(7i)/n in the half-plane a > 1 has not too weird a behavior near a segment 
— e < / < e of the line a = 1. For instance, it is sufficient to assume that the 
function is bounded for — e < t < e, , 1 < a < 2, and that the boundary function 
on cr = 1 (which then exists by necessity for almost all t between t = =be), is 
such as to satisfy any of the local conditions assuring the convergence of its 
Fourier series at t = 0 (cf. Karamata [53] and Ingham [46], where certain 
generalizations of this theorem of M. Riesz are proved). 

As to the dual problem, it is clear from (25) that, if the preceding boundedness 
condition is satisfied, and if and its boundary function ( — e < t < e) 

form a function continuous (and distinct from 0) at s = 1, then, since f (1 + it) ^ 
0, there exists a constant for which 2 f{n)n~ s = const. (s — l) -1 + 0(1) holds 
as s — > 1, where a > 1. It follows therefore from the theorem of Ikehara (cf. 
§83 below), that, if e can be chosen arbitrarily large, M(f) must exist as soon 
as /(ft) = 0 L { 1). Furthermore, this 0 L -condition can be replaced by a con- 
dition of slow oscillation (cf. Karamata [54]). 
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The sieve of Eratosthenes and harmonic analysis §38— §45 

Multiplicative functions §46— §54 



The Eratosthenian Matrix and Almost Periodic Functions 

25. For a given /(ft), the assignment (15) or (16) defines /'(ft) as the function 
which is transformed into /(ft) by the Eratosthenian matrix (e nm ). For a given 
/(ft), let f*(n) denote the function (if any; or, if there are several such functions, 
then one of them) which is transformed into /(ft) by the transposed matrix, 

(68) (e nm )* = (e«„), 

of (e nm ). Thus (15) is to be replaced by 

00 

(69) /(to) = X f*(mn) = X /*( w )> 

n = 1 m | n 

as seen from the periodic structure of (e nm ), described before (19). Corre- 
spondingly, the inversion, (24), of (15) becomes replaced by the Mobius in- 
version 

00 

(70) /*(to) = X n(n)f(mn) = X m( n/m)f(n ), 

n = 1 m | n 

since the transposed matrix of the matrix, (23), of (24) is the (formal) reciprocal 
of the transposed matrix of (e nm ). 

Actually, the reciprocal mates (69), (70), obtained by transposing the recip- 
rocal mates (15), (24), present analytical problems not arising for the latter 
mates. In fact, both linear transformations (15), (24) are defined everywhere 
and determine each other uniquely. On the other hand, neither of the linear 
transformations (69), (70) is meaningful without appropriate conditions of 
convergence. And even if such conditions are satisfied, additional assumptions 
are needed in order to legitimatize the derangements leading from (69) to its 
formal inverse (70) or vice versa (cf. §34, §38, §39, §55 below). As a matter 
of fact, it is a principal characteristic of the ergodic randomness induced on 
//(ft) by the distribution of the primes (cf. (iii), §38 below), that the two func- 
tions /, /* cannot in general determine each other uniquely, even if they are 
such as to make both series (69), (70) convergent for every m. (Cf. also (i)- 
(iv), in §38- §39). 

However, the periodic structure of the columns of the matrix (e nm ) makes it 
seem plausible that, under reasonable restrictions of convergence, the Eratos- 
thenian transformation of /'(ft) must generate hidden periodicities in /(ft). 

23 
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These periodicities can beforehand be meant only in the sense, more or less 
vague, in which the term is used by the geophysicist (who, however, has some- 
thing very definite, namely the possibility of an anharmonic analysis, in mind), 
or even only in the sense used by the applied mathematician. But the hidden 
periodicities may correspond to an almost periodic behavior in the technical 
sense of the term. The aim of this and of the next chapter is the development 
of a Fourier theory for the resulting harmonic analysis of the sieve of Eratos- 
thenes. 



26 . For m = 1, 2, • • • and n — 1, 2, • • • , let 

(71) e m (ri) = 0 unless m | n and e m (n) = m if m \ n. 

Thus, if m is fixed, the function e m (ri) of n has the period m, and vanishes except 
for a single n within a period. Since the exceptional n within a period is char- 
acterized by n = 0 (mod m), and since the product e m (n)ei(n) has the least 
common multiple, {m, Z} , of m and l as period, it is clear from (2)-(3) that 

(72) M (e m ei) = ml/{m , 1} = N{e m ei) 

(in fact, the norm N(g) is identical with the mean M(g) if the latter exists and 
g ^ 0). If the subscript is 1, then (71) is reduced to e\ (n) = 1, and therefore 

(72) to 

(73) M(e m ) = 1 = N(e m ). 

It is also clear from (71) that the representation (16) of 

(74) f(n) = Zf'(d) 

d | n 

can be written in the form 

(75) fin) = Z e m (ri)f (to) /to. 

m = 1 

Let fk{n) denote the &-th partial sum of the Eratosthenian series (75); so that 

(76) f k (n) = Z e m (n)f{m)/m, i.e., fk(ri) = Z f'(d). 

m = 1 d | n 

Since (75) has only a finite number of non-vanishing terms for every fixed n, 

(77) f k (n) — »/(n) as k — > (n = 1, 2, • • • ). 

In view of the periodicity of the functions e m (ri) of n, the series (75) suggests 
a Fourier analysis of the function /(n). However, since every f, even f(ri) = n !, 
has a unique/' by means of which it is representable in the form (75), the Fourier 
analysis suggested cannot in general exist. On the other hand, it will be easy 
to find conditions under which the harmonic affinity between (74)-(75) and the 
sequence of the partial sums (76) is so strong that the ordinary limit relation 
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(77) , which is a triviality (for every fixed n), can be replaced by various kinds 
of convergence in the mean, leading to corresponding classes of almost perio- 
dicity. 

These classes will be understood to be defined in terms of their appropriate 
approximability by trigonometric polynomials. For instance, by the almost 
periodicity ( B ) of a function f(n) will be meant the existence of a sequence of 
trigonometric polynomials ti(n), • • • , fc(n), • • • satisfying N(J — tk) — » 0 as 
k — » x , where N ( ) is the norm symbol (2) ; a symbol which is M ( ) in the nota- 
tions of Besicovitch [2]. In this definition, a trigonometric polynomial, t(n), is 
meant to be any finite sum of the form a exp (i\n) + b exp {inn) + • • • , where 
a, b, • • • , A, /u, • • • are independent of n and X, y, • • • are real. 

27. Each of the functions (71) of n is periodic (with a period increasing with 
m). Hence, every e m = e m (n) is a fixed linear combination of a finite number of 
roots of unity. In other words, every e m (n), and therefore every partial sum 
(76) of (75), is a trigonometric polynomial. This readily leads to the following 
sufficient criterion : 

(XIII) If the series 2/'(n)/?i is absolutely convergent , then the function f{n) is 
almost 'periodic {B). 

In fact (73), (75), (76) and (2) imply that 

00 00 

(78) N(J ~ h) = 2 M{e m )\f'(m) \/m = £ \f'(m)\/m^O 

m = k + 1 ?«,=&+ 1 

as k — > x , if 2 | f'(m) \/m < x. Hence the truth of (XIII) is clear from the 
definition of the class ( B ). 

This theorem (and the corresponding expansion theorem (XVI), §33, but 
not the convergence theorem (XVIII), §35) were recently proved with much 
trouble in the particular case of multiplicative functions / (van Kampen and 
Wintner [51]). The complications resulted from the use of an arrangement 
corresponding to the product representation (45) of these particular functions/. 
Such an arrangement, suggested by the whole structure of f{n) in the multi- 
plicative case, does not exist, and cannot therefore become misleading, in the 
general case. 

Since the least common multiple, {m, Z } , of two positive integers cannot 
exceed their product, ml , the assumption in (XIII) is less strict (but, of course, 
the assertion too is less strict) than in the following theorem: 

(XIII bis) If the double series 22 /'(m) f'(l)/{m, 1} is absolutely convergent , 
then the function f{n) is almost periodic (B z ). 

It is understood that the class (B 2 ) results if the requirement N(f — tk) — » 0, 
which at the end of §26 defined the class (B) = (B 1 ), is replaced by 2V (| / — tk | 2 ) 
— > 0. Correspondingly, (XIII bis) follows if (73) in the estimate (78) of 
N(f — fk) is replaced by (72) in the estimate of N(\f — fk | 2 ). In fact, from 
(75) and (76), 

\f(n) -/,(n)| 2 g £ Z 

m=k+ 1 l=k+l 
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Although the assumption of (XIII bis) is not a necessary condition for the 
almost periodicity ( B 2 ) of/(n), the appearance of the double series in (XIII bis) 
is sufficiently motivated by (21). In this connection, cf. also (XI). 

It is clear that similar, though less elegant, criteria follow for any of the 
classes ( B 9 ) belonging to a fixed Holder index, q. If f(n) = 0(1), then the 
assumption of (XIII) is more inclusive than any of the resulting sufficient 
conditions. In fact, if fin) = 0(1), then/(n) is either almost periodic ( B 9 ) for 
every q or is not even almost periodic ( B ) = iB 1 ). 

28. The proof of (XIII) might suggest that the assumption (78) is too drastic, 
if the assertion is that of (XIII). At any rate, one might expect that the as- 
sumptions of (VII), assumptions which, in view of the remark following (VIII), 
are somewhat milder than what is assumed in (XIII), are sufficient not only for 
the existence of M (/) but for the almost periodicity ( B ) of / as well. It is there- 
fore worth proving that such is not the case, not even if the 0(n) of (VII) is 
replaced by the trivial o(n) of (VIII); and that not even the adjunction of the 
assumption /' (n) = 0(1) of (X) is of avail. 

(XIV) The convergence of the series 2 f{n)/n and both estimates f in) = 0(1), 
|/'(1) | + • • • + \f'(n) | = o(n) together are insufficient f or the almost periodicity 
(B) of the function f(n ). 

In order to prove this, let/'(n) be so chosen that (i) unless n is a prime, /'(n) = 
0 and (ii) if p is a prime, f'ip) = 0(1) as p — » oc . Since there are only o(n) 
primes not exceeding n , the o-assumption of (XIV) is satisfied in virtue of (i) 
and (ii). Furthermore, (i) and (ii) imply, of course, the O-assumption of (XIV). 
Hence, it suffices to show that (i), (ii) and the convergence of 2/'(n)/n, that is, 
of 2/'(p)/p, are insufficient for the almost periodicity (B) of fin). But it is 
known that the absolute convergence of 2 f'ip)/ p is a necessary condition for 
the almost periodicity ( B ) of f(n ), if (i) is satisfied and | f'ip) | = 1 (cf. Hart- 
man and Wintner [42], where necessary and sufficient conditions are given). 
Hence, it is sufficient to ascertain that \f'ip) | = 1 and the convergence of 
2f'(p)/p do not imply the absolute convergence of 2/'(p)/p. But this is 
clear from the example /'(p) = ( — 1) J , where p denotes the j-th prime number. 

The aim of (XIV) is to show that (XIII) cannot be improved in a certain 
direction. This does not mean that the convergence of 2/'(n)/n is a necessary 
condition for the almost periodicity ( B ) of fin)] cf. (III). All that follows from 
(VIII)- (XII) is that the almost periodicity (B) of f(n) implies the convergence 
of 2/'(n)/n under certain Tauberian restrictions. In fact, the existence of 
M if) is a necessary condition for the almost periodicity iB) of fin). 

On the other hand, the assumption, 2 | f\n) \/n < x, of (XIII) implies much 
more than what is stated by the wording of (XIII). In fact, it implies (78). 
But (78) states that the particular trigonometric polynomials that are supplied 
by the partial sums, (76), of the Eratosthenian series, (75), of/(n) tend to fin) 
in the mean iB). And it will now be shown that there exist functions fin), of 
essentially arithmetical significance, which are almost periodic iB) but must be 
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approximated by a sequence of trigonometric polynomials having a structure 
substantially distinct from the structure of (76). In other words, the partial 
sums of the Eratosthenian series of these functions /(n) of class ( B ) do not have 
for f(n) the “harmonic affinity” referred to after (77). This implies, of course, 
that the assumption, 2 | f'(n) \/n < oc, of (XIII) is now violated. 

29. If S is a set of positive integers, its characteristic function, fin) = fs(n), 
is defined by placing /(n) = 1 or fin) = 0 according as n is or is not in S. Thus, 
if S(n) = /( 1) + • • • + f(n ), the ratio S(n)/n is the relative frequency of the 
elements of the set S up to n , and the existence of M (/) means that this relative 
frequency tends to a limit, M(J), as n — » cc . The remark made at the end of 
§27 is applicable to every characteristic function. 

If S i denotes the set of the integers that are representable as a sum of l squares, 
then S 4 (n) = n and S 2 (n) = o(n). Hence the characteristic functions of S 4 
and S 2 are almost periodic ( B ), having the trivial Fourier expansions 1 + 0 + 

0 + • • • and 0 + 0 + • • • . It will now be shown that the characteristic func- 
tion of S 3 is almost periodic ( B ). However, its Fourier expansion, instead of 
being of a trivial nature, is to be determined from the explicit arithmetical 
structure of S 3 (cf. §45 below). 

(XV) The characteristic function of the set of those integers representable as a 
sum of three squares is almost periodic ( B ) but such that the partial sums of its 
Eratosthenian series fail to tend to it in the mean ( B ). 

If S denotes the set of those positive integers which cannot be represented 
as a sum of three squares, and if f(n) is the characteristic function of S, then 
the characteristic function to which (XV) refers is the complementary function, 

1 — fin). Hence, it is sufficient to prove the assertions of (XV) for/(w). 

For a fixed positive integer k 7 let T k denote the set of those positive integers 

contained in the progression 4* -1 .7, • • • , 4 fc-1 .(8 j — 1), • • • . According to a 

classical result, first proved by Legendre, the set S is identical with J\ + T 2 + 
• • • . Since T\ , T 2 , • • • are seen to be mutually disjoint, it follows that/(^) = 
gi(n) + g 2 (n) + • • • , where giAn) denotes the characteristic function of T k . 
But it is clear from the definition of T k that the function g k {n) of n is periodic, 
having the period 4 /e_1 (8 j — 1) — 4* -1 (8 [j — 1] — 1) = 2 2fc+1 , and that g k {n) is 
0 except for a single n within a period, for which n it is 1. Hence the func- 
tion g k (n) of n has the mean M(g k ) = 2 _2A:_1 . But this mean is identical with 
the norm N(g k ), since g k (n) ^ 0. It follows therefore from /(n) = gi(n) + 
<72 (w) + • • • that 

(79) N if - ib 6 7m ) =5 i M(ffm) = E 2- 2 ”- 1 -» 0 

\ m=l / m=k + 1 m=k+l 

as k — > 00. Since every g m (n) is periodic, (79) implies that/(n) is almost peri- 
odic ( B ). 

The remaining statement of (XV) is that (79) cannot be replaced by (78); 
cf. the comments made at the end of §28. But the failure of the limit relation 
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(78) is quite obvious from (74)-(76) and from the structure of the functions 
gk{n) corresponding to the decomposition S = Ti + T 2 + • • • . An equivalent, 
though indirect, proof of this failure will follow by comparing (8O3) and (81) 
with (97) below. 

It may be mentioned that a theorem of Landau [57], according to which the 
asymptotic relative frequency (“density”) of the integers contained in exists 
and equals 5/6, is a corollary of (79). In fact, since every g m {n) is periodic, 

(79) implies that M(f) exists and is represented by 

(79 bis) M(f) = £ M(g m ) = £ T 2m ~ 1 = 1/6. 

m= 1 m= 1 

30. Inasmuch as only the periodic character of the mutually disjoint sets 
T i , T 2 , • • • appears to have been used, one might expect that the above proof 
for the almost periodicity ( B ) of fin) could be transcribed so as to hold for the 
characteristic function of any set S having the following property: There exists 
a sequence of positive integers d\ , d 2 , • • • such that an n is or is not in S accord- 
ing as dk | n does or does not hold for at least one k (clearly, it can be assumed 
without loss of generality* that d k | d { only when k = i). If, corresponding to 
a set S, there exists such a sequence {dk ) , let S be called a division set (belonging 
to the sequence {dk}, which can be chosen arbitrarily) . 

Besicovitch [3] constructed a sequence {d k } such that, if fin) denotes the 
characteristic function of the corresponding division set S, then M (/) does not 
exist. Since the existence of M(f) is a necessary condition for the almost 
periodicity ( B ) of any function, it follows that there exist division sets S for 
which f{n) = fs(n) is not almost periodic (. B ). However, something can be 
saved from §29. 

(XV*) If S is a division set, and if f(n) denotes its characteristic function, then 
the existence of M(f) is sufficient (and, of course, necessary ) for the almost perio- 
dicity (B) of f(n). 

The assumption that the set S be a division set is essential for the truth of 
(XV*). In fact, if the set S is unrestricted, it can obviously be so chosen that, 
if f e (ri), f 0 (n) and f(n) respectively denote the characteristic functions of the 
sets consisting of all even, of all odd, and of all, integers contained in S, then 
M(f) does, but M(f e ) does not, exist. Then it is clear from f e (n) + f 0 {n) = 
f{n) that Mif 0 ), and therefore M(f 0 — f e ), does not exist. It follows therefore 
from the obvious identity f e (n) — foin) = ( — 1 Yfin), that the Fourier constant 
Mif ), where fin) = fin) exp i — 2iri\n), fails to exist for X = i. This implies, 
of course, that fin) cannot be almost periodic iB). Nevertheless, fin) is a 
characteristic function for which M(J) exists. 

* On the other hand, the assumption that (dk , di) = 1 unless k = i would involve a 
rather serious loss of generality. In fact, if any two of the integers di , do , • • • are co- 
prime, it is clear that the set S belonging to the sequence di , d 2 , • • • is multiplicative (that 
is, that the characteristic function of S is a multiplicative function in the sense of §13), 
and is, as a matter of fact, a multiplicative set of particular structure. As to arbitrary 
multiplicative sets, cf. §52 below. 
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In order to prove (XV*), let R k denote the set containing an n if and only if 
(U | n holds for at least one i not exceeding k. Then R k is a subset of R k + 1 , 
and R k — > S as k — > oo . In other words, if h k (n) denotes the characteristic func- 
tion of R k , then h k (n) ^ h k+1 (n) and h k (n) -+f(n) as k — > oo hold for every fixed 
n) so that h k (n) ^ f(n). It is also seen that the function h k (n) of n is periodic; 
in fact, a Euclidean algorithm shows that the least common multiple of di , 
• • • , d k is a period. In particular, the mean M (h k ) exists for every k. Further- 
more, M(hi) ^ M(ho) ^ ^ 1, since h\(n) ^ hi(ri) ^ ^ 1 for every n. 

Hence, M(h k ) tends to a limit as k — » oo . According to Davenport and Erdos 
[12], the latter limit has the value 

lim inf - 23/( m )> even if M(f) = lim - 23 f(m ) 



n—* oo 71 m= 1 



n —* oo 71 m= 1 



does not exist (as it does not in the example of Besicovitch, referred to before). 
In particular, if M (/) exists, then the limit of M(h k ) as k — » oo must be M (/) . 
Since this can be written in the form M(f — h k ) 0, it follows from h k (n) ^ 
f(n) that M{\ f — h k |) — » 0. Hence, (XV*) is implied by the periodicity of the 
functions h k (n). 

The Transposed Matrix and Fourier Series 

31. The preceding chapter is only preparatory to the Fourier theory announced 
at the end of §25. This theory will now be developed. 

The Fourier analysis will automatically lead to the transposed matrix, (68), 
of the Eratosthenian matrix. In fact, the Fourier constant of f(n) will appear 
in the form (69), if the operator * is applied to the function f'{n)/n. However, 
the occurrence of the Mobius function in the inversion, (70), of (69) will induce 
phenomena of divergence corresponding to those in the theory of ordinary 
Fourier series. 



32. It is clear from (71) that the matrix elements of the Eratosthenian trans- 
formation (74)- (75) can be written in the form 



( 80 i) 

and so 



nW = 23 exp (2?r inj/77l) = \ 



[m if 71 = m, 2m, 



3=1 



[0 unless m | n , 



[v if k ^ a, 

(8O2) ei(n) = 1 and e v k(n) = { 71 = p a q ••• , 

[0 if k > a, 

where a , b , • • • are non-negative integers and p, q , • • • denote distinct primes. 

Let c m {n) be defined as the expression obtained by retaining in the sum (8O1) 
only those terms, p(m) in number, for which j is relatively prime to m. Thus 



(8O3) 



n {ri) = 23 ; exp(27rmZ/ra), where 1 ^ l ^ m and (l, m) = 1. 
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Then e m (n) = c d {n), by (80i). Hence, from (15), 

d | m 

(80 4 ) /'(m) — Cm(n) if f(m) = e m (n ) for fixed n. 

Consequently, from (18) and (17), 

(805) c\(ri) = 1 and c p k(n) = e p k(n ) — e p k-i(n), ( k = 1, 2, •••), 

since e\ (n) = 1, by (8O1). Also 

(80 6 ) c m { 1) = g(ra) 

in virtue of the definition (22). Furthermore, from (8O5) and (8O2), 

(80 7 ) c p (n) = — 1 unless p | n and c p (n ) = p — 1 if p | n. 

It is clear from (71) that, if n is fixed, e m (n) is a multiplicative function of m. 
It follows therefore from (80 4 ) and (39) that the same is true of c m (n). This 
proves without any calculation the principal formal property of the so-called 
Ramanujan sums, (80 3 ). 

Even the “explicit” representation 

(80s) c m (n)/(t>(m) = , n))/(j)(m/(m , n )), 

pointed out by 0. Holder, follows with a minimum of labor, if it is observed 
that, in virtue of (80 4 ), (71) and (24), 

(80 9 ) c m (n) = id(m/d)d, 

d | (m,n) 

where (m, n) denotes the greatest common divisor of m and n. 

33. If (8O1) is combined with the Eratosthenian identity (75), it is easy to 
show that the assertion of (XIII) can be completed as follows: 

(XVI) If the series 2 f'(ri)/n is absolutely convergent , then the function f in) is 
almost periodic ( B ) and has the Fourier series ( B ) 

00 

(81) fin ) ~ ^ a m c m (n), 

m = 1 

where c m (n) is the Ramanujan sum (80 3 ) and the Fourier coefficients a m are given by 

(82) a m = £ — 5 («! = M(f) = E f'(n)/n). 

rn | n n 

In fact, insertion of (80i) into (76) gives 

fk(n) = E E exp {2irinj / m) . 

m= 1 m 3=1 

Since the absolute convergence of S/'(n)/?2 implies the relation (78), according 
to which f k (n) tends to/(n) in the mean of the space ( B ), it follows, by letting 
k — > oo, that 

(81 bis) f(n) ~ exp(27rmr/s), 

h=l \=rls hS 
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where (r, s) = 1, and X ranges over all rational numbers r/s contained in the 
interval 0 < X ^ 1. Since the definitions (80 3 ), (82) imply that (81 bis) is 
identical with (81), the proof of (XVI) is complete. 

34 . Inasmuch as (75) is an identity, one might expect that the representation 
of f(n) in terms of its Fourier expansion (81)-(82) also is an identical relation. 
But such an expectation is erroneous. 

(XVII) The absolute convergence of 2/'(n)/n, which according to (XVI) implies 
for f(n) the Fourier series (81), is compatible with the divergence of the latter. 

In fact, it is clear from (80e) and (82) that the fc-th partial sum of the Fourier 
series (81) at n = 1 is 

k 

(830) Sk = fl(nT) t n , 

772=1 m I n 

if t n is an abbreviation ior f'(ri)/n. Hence, in order to prove (XVII), it suffices 
to show the existence of sequences {t n } for which the series 2 t n is absolutely 
convergent but the corresponding sequence {$&}, defined by (83 0 ), is divergent. 
Consequently, it is more than sufficient to show that 2 | t n | < is compatible 
with Sk 5* 0(1) as k — > oo . But (83 0 ) can be written in the form 

oo d ^ 7 1 

(831) Sn = ^ ^ Oi nm tm ) (832) OLnm = ^ 1 m(^) 

771=1 d | 771 

(if k, m , n in (83o) are replaced by n, d, m respectively). Furthermore, it is 
clear from the Lebesgue-Toeplitz construction that, if a linear transformation 
(83i) violates the norm condition 

lim sup | CX-nra | < 00 , 



then it certainly cannot transform every sequence {t n } satisfying 2 | t n | < °° 
into a sequence {s n } satisfying s n = 0(1) as n — > oo. Consequently, it is suffi- 
cient to show that the requirement expressed by the last formula line is violated, 
if the matrix ( a nm ) is defined by (832). In other words, it is sufficient to ascer- 
tain the existence of a function pair n = n(k),m = m(k) satisfying 



(84) n(k) 



m(k) 



oo and 



d^n{k) 

m(^) 

d | m(k) 



oo as k 



But such a function pair can readily be constructed by using products of lacunary 
sequences of primes. 

The construction necessarily leads to a steep “curve” n = n(k), m = m{k) in 
the (n, m)-“plane”, since, according to (83 2 ), 



(85) | a nm | ^ n and | a nm | ^ d(m ), 

if d{m) denotes the number of the divisors of m (in particular, every column as 
well as every row of the matrix (< a nm ) consists of a bounded sequence of numbers). 

The expressions considered above were the partial sums (83 0 ) of the Fourier 
series (81) at n = 1. It is clear from (80s) that the construction could be carried 



32 



ERATOSTHENIAN AVERAGES 



out for every fixed value of n in the Fourier series (81). In view of the diagonal 
principle of Cantor, it might even be possible to construct an example satisfying 
the assumption of (XVI) and rendering the Fourier series (81) divergent for 
all n. 

If m ^ n, then d \ m implies that d ^ n, and so it is clear from (83 2 ) and (24 
bis) that 

(85 bis) ctni = 1 and a nm = 0 for 2 ^ m ^ n. 

But if n < m, the value of the sum (83 2 ) is not under explicit control. Thus 
the fact that the full force of the assumption 2 | t n | < 00 does not appear to 
have been used in the reduction of s n ^ 0(1) to (84), is suggested by an observa- 
tion of Hadamard [33] concerning the converse of the Dirichlet-Dedekind criteria 
for convergent partial summation. 



35 . In view of (XVI) and (XVII), the situation is comparable to that in the 
theory of ordinary Fourier series of functions f(t) on the interval 0 ^ t < 2 t 
(the common assumption, 2 | /'(ft) \/n < oc, of (XVI) and (XVII) corresponds, 
besides to L-integrability, to continuity or integrated continuity near every t). 
Correspondingly, there arises the need for a criterion similar to the convergence 
theorem of Dirichlet-Jordan, for instance. It is indicated by (85) that a suffi- 
cient criterion of this type will somehow involve the function d(m) of m. Actu- 
ally, a simple (and, as will be seen in §36 and §46— §54, quite useful) analogue 
of the Dirichlet-Jordan theorem turns out to be the following: 

(XVIII) If the series 2 d(n)f'(n)/n is absolutely convergent , then not only (81) 
holds for (82) but also 

00 

(86) f(n) = 2 a m c m (n), (n = 1,2, ■ ■ ■ ). 

m= 1 



Furthermore , although f(n) ^ 0(1) in general , the convergence of the Fourier series 
(86) is absolute (but, if f(n) 0(1), certainly not uniform) for all n. 

The assumption of (XVIII) is more severe than that of (XVI)-(XVII), but 
only slightly so, since 

1 g d{n) ^ 0(log 1/e n) if e > 0, but din) = 0(« 1/lo * logB ). 



It is clear from (8O9) that | c m (n) | ^ a ((m, n)), if a(k) is the sum of the divi- 
sors of k. Hence, if a n denotes the maximum of <j(k) for 1 ^ k ^ n, then 
| c m (n) | ^ (T n . Since a n is independent of m, it follows from (82) that the series 
(86) belonging to an arbitrary n is absolutely convergent if 



(87) 



771=1 



m \ n 



Hf'(n)/n lEI I /'(»)/« | < 



771=1 771 | 71 



Since the range of the double summation contains a fixed positive integer, say k, 
exactly d(k) times, the last double series can be contracted into 2 d(k)\f'(k) \/k. 
This proves that part of (XVIII) asserting the absolute convergence of the 
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series (86). That the sum of the series (86) is precisely /(n), is clear from the 
proof of (XVI), since the convergence of the double series on the right of (87) 
ensures the legitimacy of the formal rearrangement of (75) into (86). 

36. Since the sum (80 3 ) is a periodic, and therefore bounded, function of n 
for every m, the parenthetical remark of (XVIII) is obvious. Hence, in order 
to complete the proof of (XVIII), it is sufficient to exhibit functions /(n) satis- 
fying the assumption of (XVIII) but violating f(n) = 0(1). 

An interesting function having these properties is/(n) = co(n) — v(n), where 
u(n) denotes the number of all, and v(ri) the number of all distinct, prime divisors 
of n. In other words, co(n) is the sum of the exponents a, (3, • • • in the factoriza- 
tion p a (f • • • of n into powers of v(n) distinct primes p, q, • • • , where it is under- 
stood that co (1 ) = 0 and v(l) = 0. Thus it is seen from (15) that the function 
v'{n) belonging to v(n) is 1 or 0 according as n is or is not a prime. Corre- 
spondingly, co'(n) is 1 or 0 according as n is or is not a prime power (which can 
be a prime). 

Since the connection (15) between the functions f(n),f'(n) is distributive, it 
follows by subtraction that, if f(ri) = co(n) — v(n), 

f'(n) = 1 if n = p 2 , p 3 , • • • and f'(n ) =0 if n 9 * p , p\ • • • , 

where p denotes any prime number. Inasmuch as the number, d(n), of aU 
divisors of n is k + 1 when n = p k , it follows that 

E d(n) \f'(n ) | /« = iE(H l)/v k = E (fc + 1) 0(2"*) < . 

n = 1 k=2 p A— 2 

Hence, the assumption of (XVIII) is satisfied by f(n) = c o(n) — v(ri), although 
f(n) 9 ^ 0(1). 

It also follows that the series (82) is the sum of the reciprocal values of those 
prime powers (distinct from p° = 1) which are multiples of m but are not primes. 
This implies that the Fourier constant a m vanishes unless m = 1, p, p 2 , • • • , 
and that ai = S/'(n)/n = M(f) has the value 2 ( p 2 — p) -1 , where p ranges over 
all primes. 

37. Since two trigonometric polynomials (80 3 ) belonging to two distinct values 
of m have no frequency in common, it is clear that, if a series (86), where ai , 
a 2 , • • • are arbitrary constants, is uniformly convergent for all n , and if f(n) 
is the function defined by (86), then f(n) is uniformly almost periodic (that is, 
almost periodic in the sense of Bohr) and has the Fourier series (81). 

However, it does not follow that the Fourier constant a m must then be repre- 
sentable in the form (82). Furthermore, an adaptation of standard construc- 
tions leads to a uniformly almost periodic function f(n) having a Fourier series 
of the form (81) which, however, fails to converge uniformly. Finally, the uni- 
formly almost periodic function f(n) = exp(27rm/3), being identical with its 
Fourier series, has a Fourier series not representable in the form (81); in fact, 
the sum (80 3 ) belonging to m = 3 consists of <p(3) > 1 terms. 
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The transcription of the proof, (78), of (XIII) to the case of uniform almost 
periodicity leads only to the sufficient criterion represented by 



(88) 



E I /'(») I < °° • 



In fact, since the function (71) of ft is periodic and has an absolute value not 
exceeding m, the Eratosthenian series (75) is a uniformly convergent series of 
periodic functions, if (88) is satisfied. 

Clearly, (88) entails the assumption of (XVIII). Actually, the Fourier series 
(86) of /(ft) is absolutely-uniformly convergent, if (88) is satisfied. 

In fact, since the sum (80 3 ) consists of <p(m ) terms of absolute value 1, it is 
clear from (82) that the series (86) is absolutely-uniformly convergent whenever 



(87*) 



E 4>{m) 



E /'(»)/» 

m | n 



^ EE 4>( m ) \f'(n)/n\ < 



771=1 771 1 71 



But the range of the double summation contains a fixed integer, say k, exactly 
d(k) times, and the corresponding f'(n)/n each time is multiplied by <p(d), where 
d runs through the d(k) divisors of k. Hence, the requirement expressed by the 
convergence of the double series (on the right) is identical with 



(88 bis) ^2 | f'{k)/k | ^2 4>{d) < <*> or, since ^2 4>{d) = k , 

k=l d | k d | k 

with the assumption (88). This proves the assertion. 

It does not follow that the assumption (88) implies for the Fourier series (86) 
a convergent majorant 2 a m , where a m (> 0) is independent of ft. Nevertheless, 
it is clear from the preceding remarks that condition (88) as a sufficient criterion 
for the uniform almost periodicity of /(ft) is of a trivial nature. On the other 
hand, if (88) is replaced by the assumption that 2 /'(ft) is a convergent series 
(an assumption under which (29 3 ) has been established and which therefore is 
more than sufficient for the existence of M(f) and for the convergence of 
2/'(n)/n), then /(ft) need not be uniformly almost periodic; not even if /(ft) is 
assumed to be almost periodic ( B ). In fact, if /(ft) = <t>(n)/n, then /(ft) is al- 
most periodic ( B ) but is not uniformly almost periodic (cf. §48 and §49 below), 
although 2 /'(ft) is convergent, since (15) holds for /(ft) = 4>(n)/n if /'(ft) = 
^(ft)/ft (the convergence of 2 n(ri)/n is assured by the prime number theorem). 
Conversely, the convergence of 2 /'(ft) is easily seen to be unnecessary for the 
uniform almost periodicity of /(ft). 

All of this shows that a necessary and sufficient condition characterizing those 
functions /'(ft) for which the function /(ft) defined by (15) becomes uniformly 
almost periodic must involve arithmetical criteria of a high degree of delicacy. 
But the existence of a reasonably explicit condition of this type is problematic ; 
the more so as, from the point of view of the standard arithmetical functions, 
the notion of uniform almost periodicity appears to be a somewhat artificial 
construction* (cf. §49— §51 below). 



* It is interesting that in this regard the situation is the same as in the applications of 
the notion of uniform almost periodicity to general dynamical problems. Cf. A. Wintner, 
Proc. Nat. Acad. Sci., vol. 27 (1941), pp. 311-314; N. Wiener and A. Wintner, Am^. Journ. 
of Math., vol. 63 (1941), pp. 794-824; P. Hartman and A. Wintner, ibid., vol. 65 (1943). 
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The Sieve of Eratosthenes and Harmonic Analysis 

38. The connection between the assumptions and assertions of (XIV)- 
(XVIII), on the one hand, and the fact that (75) is an identity for every/, on 
the other hand, becomes revealing if the details of the formal rearrangement, 
mentioned at the end of §35, are followed in case of a few functions of arithmeti- 
cal interest. 

(i) Let either /(n) = v(n) or/(n) = co(n), where v{n ), c o(n) are the functions 
defined at the beginning of §36. Both functions (32) are of the order of log log n 
in either case, since 

23 - ~ log log 71, EE \ < 00 , -23 v ( m ) ~ l°g log n 

V<n P p k= 2 P 71 m= 1 

and M(co — v) = 2 (p 2 — p)~\ by the end of §36. In particular, neither M(f) 
exists. However, since v(n) is the number of all primes p satisfying p | n, 
and since (80 7 ) states that 1 + c p (n) is p or 0 according as p does or does not 
satisfy p | n, it is clear that 

v{n) = E -- + -- p(n) 

P P 

is an identity inn. This infinite series has only a finite number of non-vanishing 
terms for every fixed n. It can be thought of as representing the series (86) 
in the case/ = v; in the sense that, in accordance with the fact that both func- 
tions (32) belonging to/ = v are asymptotically equal to log log n, the coefficient 
a\ = M(f) turns out to be 2 p -1 = <x> . If m > 1, the identification of the last 
formula line with (86) gives a m = 0 unless m = p, and a v = p~ l for every p. 
And all of this is precisely what is assigned by (82), even if tti = 1 is not excluded. 
In fact, it was seen in §36 that v'(n) = 0 unless n = p and that v r (p) = 1 for 
every p. 

(ii) It is clear from the results of §36 that the preceding remarks remain valid 
if the number, v(n), of all distinct prime divisors of n is replaced by the number, 
co ( 71 ), of all prime divisors of n. On the other hand, the situation becomes almost 
the opposite if either of these functions f(n) is replaced by f(n) = d(n), the 
number of all divisors of n. In fact, Ramanujan ([69], p. 186) has shown (on 
the basis of the prime number theorem) that the trigonometric series (86) having 
the coefficient a m = — tti -1 log m for m = 1, 2, • • • converges, and has the sum 
d(n), for every n. Since d'(n) = 1 for every n in virtue of (15), and since 
d(l) + • • • + d(n) ~ n log n, both functions (32) belonging to f(n)= d(n) are 
of the order of log n. In particular, M (J) must be identified with co . But 
a m = —m~ l log tti is 0 when m = 1, and so the identification ai = M(f), which 
for / = v gave 00 = 00 , now gives 0 = 00 . Finally, since/' (n) = 1 for every n, 
the series (82) gives — tr - 1 log m = 00 for m = 1, 2, • • • . Nevertheless, the 
series (86), which for f = v was a trigonometric series only in a metaphorical 
sense, is now bona fide trigonometric. 

(iii) It may be shown that neither of the series just mentioned is a Fourier 
series in any sense (for instance, in the sense to be used in (*), §55 below). A 
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function with a Fourier series ( B ) will now be considered. First, the prime 
number theorem (for the arithmetic progressions) is known to imply that each 
of the Kluyver series 2 n(n)/n , 2 n(2 n)/n, • • • , 2 ix(mri)/n , • • • is convergent 
and has the sum 0, which means that each of the series (82) converges to 0 when 
f'(n) = n(n ). According to (22), the corresponding f(n) is 0 unless n = 1, 
which implies that /(n) is almost periodic ( B ), with fin) ~ 0 + 0 + • • • as Fourier 
series. Consequently, the assertions of (XVI) are true, although the assump- 
tion, 2 \ n(n) | /n < 20 , of (XVI) is violated. Furthermore, (86) becomes 

1 = 0 + 0 + • • • when n = 1 but is correct for every n > 1. . 

(iv) A dual of the example (iii) results if a m is assigned to be m -1 for every 
m and the function /' is then defined by the formal inversion of (82). The 
formal inversion of (69) being (70), the value of f'(m) is 2 g(n)(mn) -J , i.e., 
m~ l 2 ju(n)/n, which, by the prime number theorem, is 0 for every m. Corre- 
spondingly, (80e) shows that the sum of the series (86) belonging to a m = mT 1 
is 0 when n = 1. According to Ramanujan ([69], p. 185), this implies that the 
sum of the series (86) belonging to a m = m~ l is 0 for every n. Since /'(m) = 0 
for every m, it follows from (15) that (86) is true for every n. 

39. These examples suggest the following observations: 

( 1 ) Although f(n) was almost periodic ( B ) in (iii), the convergence of the 
series (82) depended on the prime number theorem. Correspondingly, the 
series (82) need not converge whenever /(n) is almost periodic ( B ). Examples 
to this effect can be constructed by observing that, if e(n) denotes the simple 
vibration e lXn belonging to a fixed real X, then e\n ) = ^ fi{n/d)e Xd , by (24). 

d | n 

( 2 ) There exist sequences ai , a 2 , • • • for which each of the series 2 a v , 

2 a 2w , • • • , 2 a mn , • • • is convergent and has the sum 1 (for a class of simple 
examples, derived by ordinary harmonic analysis, cf. Rajchman [68]). Hence, 
if f'W = na n , then each of the series (82) is convergent and ai = a 2 = • • • =1 
holds for the corresponding / ( n ). This means that the formal rearrangements 
referred to at the beginning of §38 can lead, instead of to (81)-(82), to (82) and 
to the trigonometric series 2 c m (n); a series which corresponds to J + 2 cos mt 
in the analogy pointed out before (XVIII). 

(3) Let ft , /3 2 , • • • be a sequence having the same property as, but distinct 
from, the sequence a\ , a 2 , • • • used before. Then each of the series (82) 
belonging to f'(n) = n(a n — ft) converges and has the sum 0, as in (iv). How- 
ever, the situation is not the same as in (iv). In fact, since a n ^ ft for at least 
one 7i, it follows from (15) that the function/(n) does not, although the coefficient 
a m of the series (86) does, vanish identically; so that the relation (86), which in 
(iv) was true (for every n), is now false (for some n). 

(4) The preceding example and (iv) imply that Cantor's uniqueness theorem 
can in no sense be paralleled in the analogy referred to before (XVIII). 

(5) In view of the problem of Fourier constants of periodic functions of class 
(L), there cannot be expected a direct characterization of sequences ai , a 2 , • • • 
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for which the trigonometric series 2 a m c m (ft) is the Fourier series of some /(ft) 
of class ( B ). On the other hand, since (8O3) consists of <t>(m) terms, it is clear 
from the analogue of the Fischer-Riesz theorem that 2 a m c m (n) is the Fourier 
series of some /(ft) of class ( B 2 ) if and only if 2 | a m | 2 < 00. Cf., how- 

ever, (XV) and the remark made at the end of §27. 

40. For an arbitrary function /(ft), let 

(89) /x(ft) = fin) exp ( — 2 iri\n), 

where X is a real number. Since X can be reduced mod 1, it will always be 
assumed that either 0<X^lor0^X<l. If the mean, (3), of (89) exists for 
a fixed X, the value M(f x ) is called the X-th amplitude of the function /(ft). 
In particular, the 0-th amplitude is M(f), if M{f) exists. 

The function /(ft) is said to have an amplitude function, M(f x ), 0 ^ X < 1, 
if its X-th amplitude exists for every X. In this and only in this case has /(ft) 
a Fourier series 

(90) /(ft) ~ M(f Xk ) exp (27 rt'Xjfeft), 

k 

if (90) is defined to be just an abbreviation for the following pair of statements: 

(i) The mean M{f x ) of (89) exists for every X and 
in) the value of M(f x ) is 0 unless X is a X* (mod 1). 

If /(ft) is almost periodic (B) and (90) is its Fourier series (B), then the latter 
is the Fourier series in the sense {i)-{ii) also. The converse is true provided 
that /(ft) is supposed to be almost periodic (B). The latter proviso cannot be 
omitted. In other words, the existence of an amplitude function is necessary 
but not sufficient for the almost periodicity (B) of /(ft). 

For instance, let /(ft) be the characteristic function of a set S (as defined at the 
beginning of §29), and suppose that /(ft) is almost periodic (B). Then it is 
almost periodic (B 2 ), since /(ft) = 0(1). Consequently, by Parseval's relation. 

(91) M(/)-M(/) 2 = Z \M(fd | 2 , 

0<X<1 

since f = f 2 and M (/ 0 ) = M (/) . But the harmonic law of large numbers implies 
that if a set S of positive integers is chosen “at random”, then its characteristic 
function, /(ft),, possesses an amplitude function, M(f x ), which vanishes for 
0 < X < 1 but is J for X = 0 (Wiener and Wintner [86]). And such an /(ft) 
cannot be almost periodic (B), since it reduces (91) to \ — (J) 2 = 0. 

It should be mentioned that, k being an integer, the notation (90) involves a 
tacit assumption. No such assumption is needed if /(ft) is either almost periodic 
(B) or such that g = f 2 has a finite norm (2). In fact, it is clear in the first case, 
and it follows from an adaptation of Bessel's inequality in the second case, that 
M(f x ) = 0 for every X not contained in an enumerable set. But if fin) is not 
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almost periodic ( B ) and the assumption N(f) < oo is lightened to N(f) < oo 
(or, for that matter, to N(f~ e ) < oo ), it has never been proved or disproved that 
the amplitude function must vanish on the complement of an enumerable X-set. 
On the other hand, not even N(f) < oo is assumed in the definition (i)-(w) 
following (90). Cf. Wintner [91]. 

41 . If an /(ft) is such that, for a fixed Z, the function 

(92) f l \n) =f(n)f(n + l) 

of ft possesses a mean, ilf(/ (0 ), the latter is called the Z-th auto-correlation 
coefficient of /(ft). In particular, if M ( | / | 2 ) exists, it is the 0-th coefficient of 
auto-correlation, provided that Z = 0 is allowed in the preceding definition. 
In view of (130), §59 and (**), §60 below, it will be convenient to assume that Z 
denotes a positive integer. This agreement deviates from the usual terminology, 
according to which /(ft) is said to possess an auto-correlation function if M(f {l) ) 
exists for every non-negative integer l ; a terminology which alone makes possible 
a spectral characterization of the class (. B 2 ) of almost periodic functions (cf. 
Wiener and Wintner, [87], Theorem 3). 

If /(ft) has an amplitude function, that is, a Fourier series in the sense defined 
in §40, then substitution of (90) into the product (92) gives 

(93) M{f W ) = Z I M{h k ) | 2 exp (2 Tt\ k l), 

k 

provided that the operation (3) is applied on (92) in a formal way. Straight- 
forward examples show that the latter proviso is not always legitimate. 

42 . The definitions of §41 will be used in §50 and §59— §60. For the present, 
only the notations of §40 will be needed. 

Since the Lambertian formulation, (59), of the definition, (15), of /'(ft) is an 
identity in the positive parameter r (in case of convergence), it can be replaced 
by the corresponding identity in a complex variable, z; that is, by 

(94) 2/(ft)z n = F(z), where F(z) = 2 f'(n)z n /(l — z n )- 

Correspondingly, the results derived from (59) for M ( / ) can be transcribed to the 
case of an arbitrary X in (89). The simplest fact in this direction may be 
formulated as follows: 

(*) If /(ft) has an amplitude function, and if a = a(\) is an abbreviation for 
the function M(/_x), where 0 ^ X < 1, then (94) is convergent for | z | < 1, 
and a(\) has the property that 

(95) (1 - r)F(z ) -> a (A) as 2 -> e~ 2 " x , (| 2 | = r < 1), 

where arg z = —2ir\ is fixed or, more generally, z approaches the circle | z | = 1 
within a Stolz wedge. If (94) is convergent for | z | < 1, and if /(ft) = 0 L ( 1) 
as ft — > oo ? then/(ft) possesses an amplitude function, M(f\) = a( — X), whenever 
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there exists on the range 0 ^ X < 1 a function a = a(\) satisfying (95) for every 
fixed X in case of radial approach. 

For every fixed X, the first part of the assertion (*) is only a restatement of the 
fact that the summability (C, 1) of a series suffices for the existence of the 
Abelian, and even of the Stolzian, limit of the power series generated by the 
series (the series to which this fact is to be applied is the one having (89) as its 
n - th partial sum). Correspondingly, the second part of the assertion (*) 
follows, just as in the proof of (IX 2 ), from the elementary Tauberian theorem of 
Hardy and Littlewood (their theorem is to be applied to the two series having 
the real part of f(n) + f\(n) and the imaginary part of if in) + f\(n) as their 
n-th partial sums, if / ^ 0; and/(n) = 0 L ( 1) is, of course, reducible to / ^ 0). 

43 . The first (that is, the purely Abelian) part of (*) leads to a short proof of 
an essential refinement of the so-called Knopp-Landau theorem in the theory of 
Lambert series. 

In order to see this, let the coefficients of the Lambert series (94) be restricted 
by the condition 2 \f'(n) | /n < «>. After it has repeatedly been observed 
that, if X is rational, say X = l/m, where 1 ^ l ^ m and ( l , m) = 1, then the 
limit a(\) occurring in (95) must exist, and possess the value (82) belonging to 
the index m thus defined, Landau [59] has shown by a somewhat lengthy (since 
Diophantine) proof that (95) is satisfied by a(X) = 0, if X is irrational; cf. Spath 
[71]. However, it is clear from (80 3 ) and from the Abelian character of the proof 
of the trivial part of (*) in §42, that a much sharper result is contained in that 
corollary of (XVI) according to which (81)-(82) holds if (81) is meant in the 
sense of §40. In other words, the full force of (XVI) is not used, since the almost 
periodicity ( B ) of (81) is not needed, and what results is nevertheless a Tau- 
berian refinement of the Knopp-Landau theorem, although no Tauberian ele- 
ment, but only the Abelian lemma of Frobenius, occurs in this deduction. 

44 . If a function f(n) is almost periodic ( B ), its Fourier series need not be 
expressible by means of the Ramanujan sums (80 3 ) in the form (81). In fact, 
if {X*} is any real sequence, and if the k - th coefficient in (90) is chosen to be , 
where {/ 3k } is any sequence satisfying 2 | /3 k | < <x>, then the series (90) con- 
verges to a uniformly almost periodic function having the series (90) as its 
Fourier series. In the general case, every X& is rational if, but not only if, (90) 
can be contracted so as to have the form (81). In other words, (81) and (80 3 ) 
require that the <£(ra) values of k, which in virtue of X& = l/m belong to a common 
m, should correspond to Fourier constants which are independent of l , where 
(1, m) = 1 and 1 ^ l ^ m. According to the proof of (XVI), this condition is 
satisfied if the sequence of the partial sums of the Eratosthenian series (75) 
of/(n) tends to/(n) in the mean (B). 

As an illustration of the last remark, it is instructive to verify the second of the 
ssertions of (XV) by showing that the Fourier series (B) of the function, /(n), 
defined in the proof of (XV) is not of the form (81). 
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First, if g(n) is any periodic function, and if r denotes its period, then trigono- 
metric interpolation supplies the Fourier expansion 

r r 

(96) g(n) = 22 <*; exp (2 mnj/r); a ,■ = r~ l 22 g(n) exp ( — 2mjn/r). 

7=1 n= 1 

Next, if g(ri) is 0 for r — 1 of the r values of n contained in a period, and if 
g(s) = 1, where s denotes the exceptional n between n = 1 and n = r, then (96) 
shows that aj = r~ l exp( — 2irijs/r ), hence 

r 

(97) g{n) = r~ l 22 exp (2 iri[n — s]j/r). 

7 = 1 

It now suffices to apply (97) to each of the functions g = gi , g 2 , • • • defined 
in §29, and to observe that, in virtue of (79), the Fourier series ( B ) of/(n) can be 
obtained by formal addition of the Fourier series, (96), of g = gi , < 72 , • • • . 

45. As another illustration, suppose that (90) holds for a sequence of ra- 
tional exponents \k , and replace, for every fixed m, the simple vibrations 
exp(27rmZ/m), where (Z, m) = 1 and 1 ^ l ^ m, by their <t>(m) superpositions 

m 

(98) g l m{ri) = 'Zj exp (2 iri[h 2 l + hn]/m) } 

h= 1 

[occurring in Cauchy’s proof ([8]; cf. Lerch [60]) for the reciprocity law of the 
Gaussian sums (that is, of the sums (98) belonging to n = m), and differing 
from the sums S pqv of Kloosterman [55] only in notation]. 

If (98) is introduced into (90), where l = h ,m = nik and A& = l/m by assump- 
tion, then the Fourier series appears in the form 

(99) /(n) ~ aigfi(n), 

71 = 1 l 

where the coefficients a represent the Fourier constants and the accent of the 
inner summation sign indicates the same Z-range as in (80 3 ). [Corresponding to 
the remark following (98), the Hardy-Littlewood solution of Waring’s problem 
depends on Fourier series of a type generalizing (99), the polynomial in h beneath 
the exponential sign in (98) being of degree r in the problem of r-th powers. In- 
cidentally, it is a trivial consequence of the results of Hardy and Littlewood 
that their trigonometric series obtained actually are Fourier series.] 

In the case (99), it is seen from (98) and (80 3 ) that what parallels Fourier 
series of the Ramanujan type (81) are those series in which each but the first of 
the 4>(m) Fourier coefficients a m belonging to a fixed m happens to be 0 for every 
m. In fact, (99), (98) are then reduced to 

00 m 

(100) /(n) ~ oc m g m (n), where g m (n) = 22 exp (2 Ti[h 2 + hn]/m). 

771=1 h= 1 

However, (100) involves for f(ri) restrictions essentially different from those 
imposed on f(n) by (81), (80 3 ). Actually, the formal assumption of (100) 
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replaces the Eratosthenian approximations (76) leading to (81), (80 3 ) by assump- 
tions substantially expressible in terms of the modul group (cf. Petersson [66]). 
I expect to return to the resulting questions of almost periodic behavior. 

Multiplicative Functions 

46. Let f(n) be a multiplicative function. This means, by (39), that f'(n) 
is multiplicative. Hence, | f'(n) \ /n is a multiplicative function, and so (38), 
(40) and (51) imply that 2 |/'(n) | /n < co if and only if 

(101) Z Z I Kv) - Kv*- 1 ) W < » ; /(I) = 1. 

p k= 1 

Since the number, d(n ), of the divisors of n is the multiplicative function for 
which d(p k ) = k + 1, it is similarly seen that 2 d(n) | f'(ri) | /n < °o if 

(102) EDI Kv k ) - f(P k ~ l ) \/v k < 00 ; /(i) = i. 

p k= 1 

Finally, 2 \f'(n) \ < oo is equivalent to 

(103) Z Z I Kv) - /(p*- 1 ) I < « ; /(l) = 1. 

p k=l 

Consequently, (XVI), (XVIII) and the criterion (88) of §37 respectively imply 
that 

(i) if (101) is satisfied, then/(n) is almost periodic ( B ) and 

00 

(104) f(n) ~ Z a m c m (n), a m = Z f'{n)/n, 

m= 1 m | n 

where the series representing cq = M(f), and therefore the series representing 
any of the Fourier constants, a m , is absolutely convergent in virtue of (101); 

(ii) if (101) is replaced by the more stringent assumption (102), then the 
Fourier series (104) is absolutely convergent, and has the sum/(n), for every n; 

(iii) if the drastic condition (103) is satisfied, then the Fourier series (104) is 
absolutely-uniformly convergent and has the sum f(n ), which implies in par- 
ticular the uniform almost periodicity of the function f(n). 

47. As an illustration, consider the sum, v a (n), of the a-th powers of all 
divisors, d , of n, where d a is defined in terms of the real logarithm of d. Thus, 
if a is arbitrarily fixed, 

(105) f{ V k ) = Z V ~ ai if f(n) = «r„(n)/n“. 

2=0 

Since a a (n), hence f(n), is multiplicative, it follows from (ii) that <j a (n)/n a is 
almost periodic ( B ), and is represented by its Fourier series, whenever the sums 
(105) satisfy (102), that is, whenever /3 > 0, where /3 denotes the real part of a. 
Since (37), (39) and (105) imply that 

X/'(™)/™ = X n~ a /n = ^ n~ a ~ l = m~ a ~ l $(pL + 1), 
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it follows from (104) that 

00 

(106) <r«(n)/n“ = f(a + 1) Z m~ a ~ l c m {n) if 0 > 0, 

m— 1 

and that the trigonometric series (106) is the Fourier series ( B ) of the function 
which it represents. 

The identity (106) was discovered by Ramanujan ([69], p. 185) who used, of 
course, a different approach but made precisely the above assumption, p > 0. 
If p > 1, then, since (80 3 ) implies that | c m (n) | ^ m, it is implied by Ramanu- 
jan’s identity (106) that a a (n)/n a is uniformly almost periodic and has the series 

(106) as its Fourier series (incidentally, this, but nothing more than this, follows 
from (iii), §46). But the trigonometric series (106) turns out to be the Fourier 
series (B) of the function represented by it also when 0 < p ^ 1 ; a fact which, 
in view of §38, is by no means implied by the truth of (106). That /(ft) cannot 
be uniformly almost periodic, and therefore the series (106) cannot be uniformly 
convergent, in the case 0 < p ^ 1 , is clear from the fact that, the function defined 
by (105) being multiplicative, not even /(ft) = 0(1) is true when 0 < P ^ 1. 

48 . The situation is similar if <r a (ft) in (105) is replaced by </> a (ft), where <t> a {n) 
is, for a fixed positive interger a , Jordan’s generalization of Euler’s <t>{n) = </>i(n); 
that is, <t> a (n) denotes the number of a-tuples of (not necessarily distinct) positive 
integers not exceeding n and having a greatest common divisor relatively prime 
to ft. Then, for every fixed a , 

(107) f(p k ) « 1 - p- if /(») = *„(»)/»“, 

and /(ft) is multiplicative. This extends the definition of <t> a (n) to the case 
a 9 ^ 1, 2, • • • . Let / 3 denote the real part of a, and suppose that p > 0. Then, 
if (ii), §46 is applied as in §47, there results not only Ramanujan’s identity 
([69], p. 187), 

00 

(108) f(^ l)$Qc(ft)/ft = ^ d m = fi(wi) /(}) a - |_i(?ft), (/3 ^ 0), 

m=l 

which parallels (106), but also the fact that the trigonometric series (108) is the 
Fourier series (B) of the function which it represents. As in §47, this follows 
from Ramanujan’s identity if and only if ft > 0 is replaced by p > 1. But the 
assumption p > 1 makes the situation trivial and excludes Euler’s </>(ft), since 
the latter belongs to a: = 1 . 

49 . Actually, <f> a (n)/n cannot be uniformly almost periodic, hence the series 
(108) cannot be uniformly convergent, in Euler’s case. 

Since <t>(n)/n = 0(1), this negation cannot be proved by the argument applied 
at the end of §47. However, it is clear from the definition of G(f) in §3, that the 
uniform almost periodicity of /(ft) is sufficient for the existence of the Gaussian 
density, (?(/). But it is easily verified from elementary properties of the distri- 
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bution of primes, that the Gaussian density cannot exist for the function cj>{ri)/n 
= n(l — p -1 ), where p \ n. 

Accordingly, the necessary conditions (i)-(ii), §4 for the existence of G(f) 
are satisfied by the function f(n) = <t>(n)/n , but G(J ) does not exist, that is, a 
trivial necessary condition of uniform almost periodicity is violated. In fact, 
the non-existence of (?(/) implies that/(n) cannot even be almost periodic ( W ). 
In this connection, cf. Hartman and Wintner [43]. 

If \l/(n) denotes, as usual, the degree of the modular equation of the n-th 
Stufe (that is, the number of the transformation subgroups of order n in the 
modul group), then, according to Dedekind, <p(n)/n = n(l — p~ l ) must be 
replaced by \l/(ri)/n = n(l + p' 1 ), where p | n. Hence it is clear that the 
situation is precisely the same as in the case a = 1 of (105), and is therefore 
apparently less favorable than, but actually about the same as, in the case 
a = 1 of (107). 

50 . A case of uniform almost periodicity presents itself in what corresponds to 
coefficients of auto-correlation (cf. §41) of Euler’s <£(n) itself (instead of <t>(n)/n). 
In fact, if l is a fixed positive number, then, according to Ingham [45], 

n n— 1 

(109) ^ + l) ~ cf(l)n z and ^ — m) ~ \cf(n)ri 

771= 1 771=1 

as n — » oo , where the function /(ft) and the constant c are given by 

(110) f(n) = n (p 3 - 2p + l)/(p 3 - 2 p) and 3c = II (1 - 2 p -2 ) 

p | n p 

respectively. But it is clear from (110) that/(n) is the multiplicative function 
for which f(p k ) is 1 + (p 3 — 2 p) -1 . Hence the assumption, (103), of (iii), §46 
is satisfied. 

This example is of interest, since, as a rule, functions /(n) that are uniformly 
almost periodic are either too artificial from an arithmetic point of view or 
uniformly almost periodic practically by definition. In other words, the condi- 
tion of uniform almost periodicity happens to be too restrictive to have arith- 
metical applications of a non- trivial nature. 

51 . The genesis of this situation can well be illustrated by a function con- 
sidered by Gram [29] which, though instructive in itself, seems to have received 
just as little attention as a similar construction of Dedekind [13]. 

Gram’s purpose appears to have been (cf. his subsequent papers [30], [31]) 
an elementary model of the mysterious mechanism which leads to Riemann’s 
explicit formula in the distribution theory of primes. Incidentally, Dedekind’s 
parallel construction serves the same purpose (even though there is no evidence 
that Dedekind was aware of this; cf. Wintner [95]). Gram proves that, if 
F(n) = n — F([%n]), then F(ri)/n — » f as n — » oo , and his formulae lead him to 
suspect, and perhaps even to claim, that the remainder term F(n) — 2n/3 has a 
principal “wobble” (cf. Hardy [36], Chap. II). The following Fourier analysis 
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will imply that the resulting approximation is almost as misleading as, according 
to Littlewood, the principal wobble of Riemann’s series is. 

Corresponding to the Euclidean algorithm of dyadic fractions, successive 
application of the definition, Fin) = n — F([Jft]), gives 

00 [log n ] 

(111) F(n) = Z ( — 1) J [2~ J n] = Z ( — l) y [2 -, 'n], 

2=0 2=0 

since [2 ~ 3 n\ = 0 for every j > [log ft], where the logarithm is of base 2. A 
comparison of (111) with (19) gives 

(112) F(n) = Z /(m), 

7n=l 

if fin) denotes the function for which /'(n) is (— 1) J or 0 according as ft is or is not 
of the form 2\ where j = 0, 1 • • • . In other words, /'(ft) is the multiplicative 
function for which 

(113) f(p k ) = 0 if V * 2 and f(2 k ) = (-1)". 

Hence (39) and (40) show that /(ft) is multiplicative and 

(114) f(p k ) = 1 if p ^ 2 and f(2 2k ~ l ) = 0, /( 2 2k ) = 1. 

This means that /(ft) is 0 or 1 according as the exponent of the highest power of 
2 dividing n is odd or even (it is even if it is 0). In particular, the multiplicative 
function /(ft) is the characteristic function of a set S (cf. the beginning of §29). 
All of this is also clear from the dyadic interpretation, alluded to before (111); 
an interpretation which seems to have escaped Gram [29]. 

According to (111) and (112), 

n [ log n ] oo 

(115) Z /( 77i) = n Z (—2 _1 ) J + 0(log n) = n Z (— 2 _1 ) J + 0(log n), 

7n=l 2=0 2=0 

which, by (3), implies that M(f) exists, its value being the sum of the last series, 
that is, f. Furthermore, since (114) satisfies the assumption, (102), of (ii), 
§46, the function /(ft) is almost periodic ( B ) and is represented by its Fourier 
series; the latter results in an explicit form, if the values attained by the multi- 
plicative function /'(ft) are substituted from (113) into (104). Actually, as it 
will turn out in §52, the almost periodicity ( B ) of /(ft) has nothing to do with the 
explicit structure of the set S having the multiplicative function /(ft) as its 
characteristic function. 

It is clear from (113) that the multiplicative function /(ft) is 2-adic, if a func- 
tion /(ft) is called p-adic when /'(ft) vanishes for every ft distinct from 1, p, p 2 , 
(and possibly for some of the integers 1, p, p 2 , • • • also), where p is a fixed prime. 
According to Toeplitz [79], a p-adic function of ft is uniformly almost periodic if 
and only if its value attained at n = p k tends to a limit as k — > oo . On the 
other hand, a p-adic function of ft is almost periodic ( W ) if and only if it is 
bounded (Hartman and Wintner [43]). Thus it is clear from (114) that fin) 
cannot be uniformly almost periodic and that it is almost periodic ( W ). 
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52. Let a set, S, of positive integers be called multiplicative if its charac- 
teristic function, /(ft), is a multiplicative function. Clearly, a multiplicative set 
S is given if and only if it is known for every prime power, p k , where k = 1 , 2, • • • , 
whether n = p k is or is not in S; the value of f(p k ) being 1 in the first case and 
0 in the second case. It is understood that the decision of this alternative is an 
arbitrary function of the two variables p, k , and that ft = 1 is contained in every 
multiplicative S. 

Instances of finite multiplicative sets are the set of the square-free divisors of 
m and the set of all divisors of m, where m is arbitrarily fixed. The charac- 
teristic function of the latter set is e n (m)/n in terms of the notation (80i). The 
infinite multiplicative set consisting of those positive integers n which are rela- 
tively prime to a fixed m has the characteristic function p(ra, ft), where p(m, n) 
= p(ft, m) is the function considered in Kronecker’s Vorlesungen , pp. 246-250. 

Clearly, the assumption of (iii), §46 is satisfied by the characteristic function 
of a multiplicative set S only if the double series (103) has a finite number of 
non-vanishing terms (which implies that all but a finite number of primes must 
be in S ). It is easy to see that this trivial sufficient condition for the uniform 
almost periodicity of /(ft) is necessary as well (the reasons are similar to those 
which preclude for <p(n)/n the existence of a Gaussian density). In view of the 
example of §51, the conditions for the almost periodicity (W) of /(ft) are clearly 
more favorable, although /(ft) cannot be almost periodic (W) for an arbitrary 
multiplicative S. However, it is always almost periodic ( B ). 

In order to prove this, let a prime, p, be called a q or an r according as it is or 
is not contained in S. Two cases will be distinguished, according as 2r _1 < oo 
or 2 r _1 = oo. In the first case, the assumption of (ii), §46 is satisfied, since 
(102) is always majorized by 

| 1 — 1 |g -1 + S 1 0 — 1 |r _1 + ^ ^ k(l + 1 )p~ k = r -1 + const. 

q r p k—2 r 

Hence, in the first case /(ft) not only is almost periodic ( B ) but it also has a 

Fourier series satisfying (86) for every ft. This cannot be true in the second 

case, since if S consists of ft = 1 alone, then /(ft) is certainly almost periodic 
(B), and has the Fourier series 0 + 0 + • • • , although /(l) = 1; cf. (iii), §38. 
Actually, this is always true in the second case. In fact, it will be verified in §79 
that the mean M (/) exists and is 0 whenever 2r -1 = oo . But M (/) = 0 and 
/(ft) ^ 0 imply that /(ft) is almost periodic (B), with 0 + 0 + • • • as Fourier 
series. 

53. An extreme instance of the first case is the characteristic function, | p(ft) | , 
of the multiplicative set consisting of all square-free positive integers. Since 
| p(p) | = 1 for every p and | p(p k ) | = 0 whenever k > 1, it follows from (17) 
and (18) that | p(p k ) | 7 = 0 for every k ^ 2 and | p(p 2 ) | 7 = — 1. Hence 

| ^(ft) | 7 = M(ft"), where p(x) = 0 for x ^ [x], 



(116) 
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as in (23). Substitution of (116) into (104) gives the Fourier series ( B ) of the 
characteristic function | p (n) | . 

An extreme instance of the second case is the following dual of the preceding 
example: Let So be the multiplicative set consisting of those positive integers, 
say of 1 = hi < h 2 < • • • , which have no simple prime factor. Thus if 0 o (n) 
denotes the characteristic function of So , then 0 o (p) = 0 and 0 o (p 2 ) = 1, 
0o(p 3 ) = 1, • • • . Hence, by Euler's factorization, 

^2 h~ s = ^2 0o (n)n~ s 

(117) ' 

= II (1 + 0 + p~ 2s + p~ 3a +•••)= f (2s)f (3s)/f (6s), 

if (7 > | (cf. Wintner [94]). In fact, the value of 1 + p~ 2s + p~ 3s + • • • , being 
equal to 1 + p~ 2s /(l — P ~ s ), is identical with the reciprocal value of 

(1 - p“ 2s )( 1 ~ p~ 3s )/d - P" 6s ). 

54. For the Dirichlet generator of the multiplicative function y(n) defined by 
y (p k ) = k, Cantor [6] observed an identity of the same type as (117), namely 

(118) E y(n)n~‘ = f (a)f (2s)f (3s)/f (6s), 

where a > 1. He did not point out that 7 (n) has a simple significance, which 
can be realized as follows: 

If p, g, • • • are the distinct prime divisors of an integer n = p k q l • • • , where 
k > 0, l > 0, • • • and n ^ 1, let (a, (3 , • • • ), (X, p, • • • ), • • • denote any set of sets 

of positive integers satisfying a + 0 + • • • = k, X + /z + • • • =/,••• and a ^ 

/ 3 ^ • * • , X ^ p ^ ••*,•••. Then the description of all finite Abelian groups 
in terms of primary bases asserts that every Abelian group of order n is simply iso- 
morphic to the group of ordinary multiplication in exactly one collection {1; p“, 
P^j * * * J #\ (A * * * ; • • * } , and vice versa. But the number of all these collections 
belonging to a fixed n = p k q l • • • is (a + {$ + • • • ) • (A + /x + • • • ) • • • • , that is, 

kl • • • . Hence, the number of all Abelian groups of order n is kl • • • , if n ^ 1. 

Since it is 1 if n = 1, the multiplicative function 7 (n) defined before (118) repre- 
sents the number of the Abelian groups of order n. 

It follows that the number of all Abelian groups of order n, when considered 
as a function of n, is almost periodic (. B ) and is represented by its Fourier series 
for every n. In fact, it is clear from/(p°) = /( 1) = 1 that the assumption, (102), 
of (ii), §46 is satisfied if f(p k ) = k. On the other hand, the necessary condition 
(i), §4 for the existence of G(f) is not satisfied by / = 7. 

It is seen from (117), (118) and (25) that y' (n) = 6 0 (n). Hence, (104) shows 
that the Fourier constant, a m , of y(n) has the value 

cim ^ ^ 0o (n) J n ^ j h n , 

m | n ™\h n 



( 119 ) 
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by (117), where a > In particular, since cq = H f(n)/n is the mean, M(f), 
of / = 7, it is clear from (119) and (117) that, in accordance with Cantor’s 
identity (118), 

(120) M(y) = f(2)f(3)/r(6). 

Since 7 (n) is at least 1 for every n and becomes the larger the more composite 
n is, it might first appear paradoxical that the asymptotic average value of the 
number of the finite Abelian groups be so small a value as (120); a value which 
is barely 2, since f(2) = 7r 2 /6, f(6) = 7r 6 /945 and, according to Gram’s table 
([30], p. 269 or [32]), f(3) = 1.2020569 • • • . The paradox is cleared up by ob- 
serving that 7(71) = 1 if and only if n = i, where i denotes a square-free number; 
that y(7i) = 2 if and only if n = p 2 f, where (p, i) = 1 ; and so on. 



Part III 



THE STATISTICS OF THE PRIME NUMBER THEOREM 



The distribution of primes and Fourier constants §55— §61 

Poisson’s law and the distribution of primes §62— §70 

The statistics of the sum of two squares §71— §76 

The ergodic law of multiplicative sets §77— §85 



The Distribution of Primes and Fourier Constants 

55. In view of the examples (i)-(ii), §38, there arises the question as to the 
existence of a Fourier expansion for the function, f(n) = A (n) , on which the 
standard proofs of the prime number theorem depend; that is, for the function 
which, according to (26) and (15), can be defined by 

(121) A (n) 8 n p k log p, 

if 8 nm = 0 when n m and 8 nn = 1. It is indicated by a comparison of §41 with 
the approach of Hardy-Littlewood [38], [39] to Goldbach’s problem (cf. §59-§61 
below), that the Fourier expansion (90), if any, will be of the particular type of 
Ramanujan series (cf. §44- §45), that is, of the form 

00 

(122) A (n) ~ y a m c m (n). 

m= 1 

According to (80 3 ) and to the pair of conditions which defined (90) in §40, the 
assumption (122) means that 

(i) if X is rational, say X = l/m , where (I, m) = 1 and 1 ^ l ^ m, then the 
Fourier constant M (Ax), where 

(123) A \(n) = A (n) exp(— 27r ikn), 

exists and has a value, a m , that is independent of the $(ra) possible choices of l\ 

(ii) if X is irrational, the Fourier constant M (Ax) of (121) exists and vanishes. 
It will be shown that, in this sense, (122) can almost be proved, without 

recourse to any form of RiemamTs hypothesis. The italicized reservation refers 
to the possible violation of (ii) by a zero set of irrationals. It will remain unde- 
cided whether or not this X-set of measure 0 is actually vacuous (in this connec- 
tion, cf. Vinogradow [83], Theorem 1). 

(*) The prime number theorem , when formulated for all arithmetic progressions, 
implies (and is substantially equivalent to) the fact that the Fourier analysis (122) 
holds in the sense of (i) alone , and that the Fourier constants have the values 

(124) a m = g(m)/ 0 (m), (m = 1, 2, • • • ). 

Furthermore , (ii) is true at any rate for almost all X. 

According to (80 3 ), all frequencies \k occurring in (90) when the series (90) 
is of the Ramanujan type (122), have rational values, and are therefore taken 
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care of by (i) alone. Correspondingly, the role of (ii) is that of assuring the 
completeness of the Fourier analysis. Thus the first part of (*) is interesting in 
itself. 

The zero set of exceptional irrationals X, if any, cannot have anything to do 
with the fact that, as observed by Hardy [35] in connection with Ramanujan’s 
result quoted in (ii), §38, the series obtained by substituting (124) into (122) 
converges to <fi(n)/n times A(n), and not to A(n) itself, if* n^l. In fact, this 
effect of discontinuity (cf. §16) depends only on the illegitimacy of the rearrange- 
ments mentioned at the beginning of §38. 

56 . In order to prove the first part of the italicized statement of §55, let 

(126) e = exp( — 2iri\), where X = l/m; (l, m) = 1, 1 S l ^ m, 

l and m being fixed. Thus, from (121), 

(127) 2 e n A (w) = 2 log p + 2 e p2 log p + • • • . 

n<x p<x V 2 <x 

According to (3), (123) and (126), the assertion (i), §55 is that the expression 

(127) is of the form a m x + o(x) as x — > °o, where a m is supposed to have the 
value (124). But the number of the non-vacuous sums on the right of (127) 
is 0( log x), and it is clear from | e | = 1 that, in view of Chebyshev’s estimate 
A(l) + • • • + A([x]) = 0(x), the fc-th of these sums is majorized by a fixed mul- 
tiple of x llk . Thus the total contribution of all sums but the first is 0(logx)0(x ? ) 
= o(x). Hence it is sufficient to show that the first sum on the right of (127) 
is a m x + o(x). 

To this end, let P° denote the set of the prime divisors of m (for the value of 
m fixed above), and let all primes not contained in P° be classified into <£(m) 
subsequences Pi , P 2 , • • • in such a way that p is in Pj if and only if p = lj 
(mod ?n), where h , h , • • • represent the residue classes relatively prime to 
m. Then, since (126) is a primitive m-th root of unity, 

<p (m) 

(128) J2 log p = 0(1) + y e’ Mx), 

p<x 2=1 

where 0(1) represents the contribution of the finite class P° and &j(x) denotes the 
sum of the logarithms of those primes contained in P 3 which are less than x. 
Thus the prime number theorem of the arithmetic progressions asserts that 

* In terms of the above-mentioned parallelism between series 2 a m c m (n) and periodic 
trigonometric series on the interval 0 ^ t < 2 tt, the proviso n ^ 1 can be compared to the 
proviso t ^ 0 in/(0 = 2 a m cos mt if /(O) = 0 and, for instance, a m - x =1/ log m (or, for that 
matter, a m = l/m). In fact, it is seen from (80e) and (124) that the series (122) is reduced 
at n = 1 to 

(125) 2c w (1)m(w)/0(w) = 2 n(m) 2 /4>(m) ^ 2 1 = x, 



although A(l) = 0, by (121). 
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#j(x) ~ x/4>(m) as x — > qc holds for each of the values of j. On the other 
hand, e h , e ?2 , • • • are the </>( m ) primitive ??i-th roots of unity, and so their sum 
is ix{m). It follows therefore from (128) that the first sum on the right of (127) 
is a m x + o(x), if a m is defined by (124). 

57. The proof of (i) thus completed follows the pattern of an elementary 
result of Fa tou [25] for which, instead of the prime number theorem, only 
Mertens’ elementary formula is needed. 

If log p on the right of (127) is replaced by 1/p, then, according to Mertens, 
each of the 4>(rn) functions 3j(x) on the right of (128) must be replaced by 
(log log x)/<t>(m) + 0(1), where each of the functions 0(1) of x tends to a 
limit as x — » x. On the other hand, the term corresponding to the 0(1) in 
(128) is independent of x for large x, since it represents the contribution of a 
finite class, P°, of primes. It follows therefore from the absolute convergence 
of the double series 

ZZeA p\ (lEi//<A 

p A-=2 \ p A=2 / 

that, if e and a m are defined by (126) and (124), 

£ e p /p = a m log log x + Const. + o(l) as x — -> oc . 

p<x 

Since the value (124) is 0 only when m is not square-free, it follows that the power 
series 2 z p /p is convergent or divergent at the rational point z = e of the circle 
| z | = 1 according as the integer m defined by (126) is not or is square-free. 
This is the elementary result of Fatou. 

It is also seen that, if m is square-free, then there exists a real constant such 
that 2 z p /p diverges either to + °o + i const, or to — + i const, at the rational 

point z = e, the sign of x being the same as that of ju(m). But the rational 
points z = e satisfying (126) and either of the restrictions ju(m) = =tl lie dense 
on the circle | z j = 1. It follows therefore from a standard theorem on series 
of continuous functions, that those irrational points on | z | = 1 at which the 
power series 2 z p /p is divergent, with a divergence of the type + x + ic, form a 
set of the second category (and so, in particular, a set that is non-enumerable on 
every subinterval), and that the same is true if + + ic is replaced by — sc + ic. 

However, the power series is convergent almost everywhere on | z | =1. 
This follows for 2 z p /p in the same way as for the weirder function 2 2 ^(l°g p)/p 
in §58 below. 

It would be interesting to know whether or not the Fourier series 2 (sin pt)/p , 
which obviously is of class (L“) and represents the imaginary part of 2 z p /p 
on | z | = 1, is convergent for every t (instead of almost every t). 

58. The second part of the assertion (*), §55 has nothing to do with prime 
numbers, since it is implied by a general lemma on Fourier series of class (L 2 ). 

The lemma in question, when particularized to the case of power series, states 
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that the existence of a 8 > 0 satisfying 2 j h{n) | 2 n < oo (and, incidentally, even 
the condition 2 | h(n) | 2 log n < oo) is sufficient to guarantee the convergence 
of 2 h{n)z n almost everywhere on | z | = 1. Hence, if h(n) = A (n)/n and 
z = exp {it), an application of (4) to the function g(ri) = nh(n)z l of ft shows 
that, in order to prove the existence and the vanishing of the mean, M(f \ ), 
of (89) for almost all values of X in the case /(ft) = A(ft), it is sufficient to ascer- 
tain that 2 | A (ft)/ft | 2 n < oo holds for some 6 > 0. But is it clear from (121) 
that this condition is satisfied by every 8 < 1 . 

Incidentally, (121) implies that the sufficient conditions of W. H. Young for 
the common part of all Lebesgue classes (L 2 ), (L 3 ), • • • (and even the corre- 
sponding condition for an exponential class) are also satisfied. 

59. Suppose for a moment that the zero set of X- values, which are excluded in 
the second part of (*), §55, is vacuous (or at any rate irrelevant). Then the 
assertion of (*), §55 becomes that (122) and (124) represent for /(ft) = A (ft) 
a Fourier expansion in the sense defined after (90). Suppose in addition that 
the formal transition from (90) to (93) is legitimate in this case (actually, this 
second hypothesis, which is independent of the first, is all that is relevant in the 
sequel). Then (80 3 ) shows that, if l in (93) is replaced by ft, 

00 

(129) M(A <,1> ) = E | cu| 2 Cm(n), where a m = n(m)/<t>(m), 

m = 1 

by (124). In the case l = 0 excluded after (92), it is easily seen from (121) 
that the mean, (3), of the function (93) belonging to / = A, l = 0 does not exist, 
since 

(130) M (A 2 ) = M { | A | 2 ) = M{ A (0) ) = oo 

(on the other hand, the prime number theorem asserts that ilf(A) exists; its 
value is 1). 

For a fixed positive ft, let g{m) denote the ftz-th term of the series (129). Then 
< 7 (ra) is multiplicative, since all three functions /z(m), <t>(m), c m of m are. Fur- 
thermore, since n(p k ) = 0 if A; > 1, and n(p) = — 1, it is clear that g(p k ) = 0 if 
k > 1, and that substitution of (80 7 ), where p — 1 = <t>(p), gives g{p) = 1/0 {p) 
or g(p) = — l/0(p) 2 according as p does or does not divide the fixed integer ft. 
It follows therefore from <j>{p) = p — 1 and from 2 p~ 2 < oo, that the sufficient 
condition, (51), for the validity of (50) is satisfied. Consequently, (129) is 
equivalent to 

m (A <n> ) = n u + (p - in n a - (? - 1 r 2 }. 

p | n pj( n 

Since 1 — (2 — l) -2 = 0, the first factor of the second product vanishes for 
every odd ft. Hence it is easily verified that the last relation can be written in 
the form 



(131) 



M{ A (2n) ) = 2 Af{n), M{ A (27l_1) ) = 0, 
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if A is the positive constant 

(132) A = n {1 - (P - ir 2 } 

p> 2 

and /(n) denotes the function n p ~\. ^ 

2 < p | 2 n P 2 

(133) /(/) = (p - l)/(p - 2) if p * 2 and /( 2 k ) = 1 
and 

(133 bis) f(nm) = whenever (n, m) = 1; /( 1) = 1. 



60. The multiplicative function /(n) defined by (133) happens to be identical 
with Sylvester’s oscillatory function in Goldbach’s problem. 

In fact, if p{n) denotes the number of the representations of n as a sum of two 
primes, p and q , then the assertion of the rule of Sylvester [72], [75] is that 

(134) p(2n) ^ 2A/(n)n/log 2 n as n — » cc • p(2 n — 1) = 0. 



For reasons explained by Hardy and Littlewood ([38], [39]), Sylvester suggested, 
instead of (132), an erroneous numerical value for the constant A occurring in 
(134). The appearance of n/log 2 n in (134) is easy to guess, since, the sum 
p(l) + • • • + p(2n) being the number of pairs of primes satisfying p + q ^ 2n, 
the formulation t(x) ~ x/\og x of the prime number theorem implies that 



(135) A it p(m) =1 S t{ 2 n - p) 

zn m= 1 zll p<L2n 



log 2 n 



as n 



since p m ~ m log m as m — > oc . What is ingenious in Sylvester’s law, (134), 
is the discovery of the explicit form, (133)-(133 bis), of the delicate function /(n), 
which exhibits the “irregularities” of p{n). In view of the following theorem, 
it appears to have an historical interest that Sylvester states the function /(n) 
to be expressible in terms of infinite series involving roots of unity of arbitrarily 
high degree, and that, as a contradistinction, he refers to the reduction factor, 
(135), as non-periodic. 

(**) The function (129) of n, representing the formal auto-correlation coefficient , 
M{ A (n) ), of Chebyshev J s function (121), is connected with Sylvester’s law (134) 
by the relations (131)-(134), where the multiplicative function f(n), defined by (133), 
is almost periodic ( B ) and has a Fourier series of the form (81)-(82) which , in 
addition , converges to /(n) for every n. 



61. The situation in (**) is as follows. As shown by van der Corput [11], 
the method of Vinogradow [83] proves the truth of Goldbach’s conjecture for all 
but o(n) of the even integers not exceeding n (previously, Hardy and Littlewood 
[40] proved this under a modified form of Riemann’s hypothesis). However, 
Sylvester’s law has never been proved, not even under the assumption of the 
truth of Riemann’s hypothesis for Dirichlet’s L-series. But Vinogradow’s 



STATISTICS OF THE PRIME NUMBER THEOREM 



53 



method supplies the truth of the analogue of (134) not only for the case of 3 
primes, treated loc. cit. [83], but for the case of j primes as well, provided that 
j > 2 (previously, Hardy and Littlewood [39] proved the same under a modified 
form of Riemann’s hypothesis). 

Goldbach’s case, j = 2, has its particular difficulties in view of a coalescence 
of “major” and “minor” arcs. However, the formal “singular series ” belonging 
to j = 2 proves to be identical with that Fourier series ( B ) to which (**) refers ; a 
series which, by (80 3 ), is of the type described by Sylvester and represents, in 
accordance with his contradistinction, a function that is almost periodic ( B ). 
In view of §38, this is by no means obvious from the convergence of the trigono- 
metric series representing fin). In fact, since n(l + p~ l ) = oc and (133)- 
(133 bis) imply that f(n) ^ 0(1), it is obvious that the series (86) cannot now be 
uniformly convergent. Accordingly, the principal content of (**) is that the 
trigonometric series (86) now happens to be the Fourier series ( B ) of the function 
which it represents. 

Needless to say, not even the almost periodicity ( B ) of f(n) can be inferred 
from the convergence of the singular series. Actually, the almost periodicity 
( B ) of f(n) shows that the standard description of f(n) as the contribution of 
the “irregularities” of Goldbach’s problem is rather misleading, since f(n) . 
admits of quite a regular harmonic analysis; in this connection, cf. Wintner [91]. 

In order to prove not only the almost periodicity ( B ) of f{n) but the addi- 
tional assertion of (**), §60 as well, it is sufficient to observe that the assumption 
(102) of (ii), §46 is satisfied by (133). In fact, the series (102) is reduced to 
2 | f(j>) - 1 | /v, where | f(p) - 1 | /p ~ 1/p 2 , by (133). 

It may be mentioned that, in accordance with the end of §50, the problem of 
almost periodicity is trivial if Goldbach’s problem is replaced by the problem of 
j > 2 primes, since then the sum of the singular series is uniformly almost periodic 
by virtue of its absolutely-uniform convergence. 

The non-oscillatory factor, which is the generalization of (135) for j > 2, 
is just a reflection of the prime number theorem for every fixed j. In the next 
chapter, certain variants of the asymptotic laws of these non-oscillating factors 
will be considered. 

Poisson’s Law and the Distribution of Primes 

62. If S is any set of positive integers, let S(n) denote the number of those of 
its elements which do not exceed n. Thus, if fs(n) denotes the characteristic 
function of S, 

(136) S(n) = E 

m= 1 

The set S = So and its characteristic function fs(n) = 0o(n) considered in the 
second part of §53 represent the case l = 0 of the set S = Si and of its charac- 
teristic function fs(n) = 6i(n), if Si is defined as follows: A positive integer is in 
Si if and only if the number of its simple prime factors is exactly l. For instance, 
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1 V V 2 M, V\ are in So , Si , So , & , Si respectively, if q is a prime distinct from p. 
Since every n is in exactly one of the sets <S 0 , <Si , • • • , it is cleai t a 



(137i) 



oo 

Yj 0(0) = 1 for every n ■ 



1=0 



On the other hand, from (136), 



(137s) 



T. 6i(m ) = Si(n) for every 



(l = 0 , 1 , •••)• 



m = 1 



The bulk of the positive integers is certainly not in So . In fact, 

(139 0 ) SoOVn = 0(rT i+e ) 

holds for every , > 0, since, the ordinary Dirichlet series (117) being ; 

for a > i , the sum function (137.) belonging to l - 0 must be 0(n ) for every 

« > 0. On the other hand, if A denotes the constant (120), that is, 

(138) A = f (2)f (3)/f(6), 

then 

A (log log n) !_1 



SzO) 

(139) (Z — 1)! log n 

where l is fixed (cf. Wintner [94]). According to (139 0 ) and (139) the relative 
frequencies of the integers contained in S 0 and in any fixed S , ^ S 0 deviate 
almost by the order of »“*. Incidentally, the estimate (139.) can be refined to 
the asymptotic formula 

( 240 ) SoO^/n ' — ' const, n const. — f(^)/r(3), 

which, in view of (138), may be imagined to be the limiting case of (139). 

63. In order to verify these asymptotic laws, let R, denote the set of those 
square-free positive integers which are composed of exactly pnme ac ois. 
For instance, Ro consists of the single integer 1, and Ri of all primes. It was 
apparently known already to Gauss [28] that the truth of the asymptotic formula 



if 1^0, 



(n oo), 



(141) 



Ri(n) (log log n) 



l - 1 



if 1^0, 



{n 



0 



(l — 1)! log n’ 

is implied by its truth in the particular case 1 = 1, that is, by the prime number 
theorem. But (139) can be reduced to (141). 

In fact if l ^ 0 is fixed, it is clear from the definitions of Si and Ri that a 
positive integer is in S, if and only if it is a product of the form ij, where * is 
contained in , say * = p. • • • and i has only mulUple prime facto- each 
of which is distinct from all of the l distinct primes pi , • , Pj ■ ^nce, m 

order to prove that (141) implies (139), where l is fixed, it is sufficient to s o 
that s/i _1 = A, where h runs through the sequence of all those positive integeis 
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none of which possesses any simple prime factor. In other words, it is sufficient 
to show that, if 1 = hi < h? < • • • is the sequence defined before (117), then the 
series 2 /z n -1 converges to the value (138). But this follows by placing $ = 1 
in (117), where a > 

This proves (139). In the limiting case, l = 0, the following refinement of 
(140) is of the same depth as the prime number theorem, that is, as (139) itself. 

64 . Let s be replaced by s — J in the identity (117), which is valid for a > 
Then, if 

(142) g(n) = n h e 0 {n), 

(117) appears in the form 2 g(n)n~ s = rj(s)£(2s — 1), where a > 1 and 

(143) v (s) = f(3 8 - f)/f(6 « - 3). 

But 2 g{n)n~ s = rj(s)£(2s — 1) means, by (25), that 

(144) E g'(n)n~ s = v(s)?(2s - l)/f(s). 

Since the Dirichlet series of the function (143) is absolutely convergent at s = 1, 
and since the Dirichlet series of the function f (2s — l)/f (s) was seen to be con- 
vergent at s = 1 (§14), it follows from the multiplication theorem of Stieltjes, 
that the Dirichlet series (144) is convergent at s = 1. According to Dirichlet's 
analogue of Abel's continuity theorem, the value represented by the series (144) 
at s = 1 can be obtained by letting s — > 1 + 0 in (144). Then f (2s — l)/f (s) — > \ 
and so, from (143) and (144), 

(145) ^g'(n)/ n = i const., where const. = f(f)/f(3). 

The relation (140) is an elementary corollary of the variant (145) of the prime 
number theorem (and can be proved elementarily, cf. Erdos and Szekeres [21], 
where an 0-estimate of the remainder of (140), but not of course the correspond- 
ing o-estimate resulting from (145), is obtained; the numerical value of the con- 
stant occurring in (140), a value which also follows from an identity of Cantor 
[6], is loc. cit. [21] not given in the above form). 

First, since the function (142) is real and non-negative, the convergence of 
2 g'(n)/n implies, by (IX 2 ), the existence of M (g). But (I) shows that M(g) has 
precisely the value (145). Thus, if (142) is substituted into (6), 

n n 

m*0o(ra) = \ const . n + o(ri); whence 23 Oo(m) = const.n' + o(n*), 

m=l ■ m= 1 

by partial summation. According to (136), the last relation is identical with 
(140). 

65 . The characteristic functions 0 o (n), Oi(ri), • • • of the sets So , Si , • • • intro- 
duced in §62 define a classification of all positive integers into a sequence of 
mutually disjoint subsets, S m . A dual of the corresponding asymptotic prob- 
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lem results if the characteristic functions 0 o (n), 0i(n), ■ • • are replaced by the 
functions ft (n), ^ 2 (n), • • • which are defined as follows: p m (ri) is the non-negative 
integer representing the number of those positive integers among the expo- 
nents a, b, • • • of n = p a q • • • which happen to have the value m, where it is 
understood that p, q, • • • are distinct primes and that p m (l) = 0. In other 
words, fim(n) denotes the number 0) of the m-fold prime factors of n, where 
m = 1, 2, • • • . Thus (15) shows that, for every fixed m, 

(146) Pm'(p k ) = hnjn and = 0 unless n = p k , (Jc = 1, 2, • • • ), 

where the notation is the same as in (121). 

If c o(n) denotes, as in §36, the number of all prime divisors of n, then obviously 

00 

(147) o o(n) = 22 Pm(n) for every n. 

m=l 

• 

It is easily verified from (146) and (147) that everything that was proved in §36 
for the function fin) = c o(n) — v(n ) holds for the function /(n) = co(n) — fii(n) 
also. In particular, the latter function is almost periodic ( B ); so that, since 
M (co) does not exist, M (fa) cannot exist. 

The situation is quite different if m = 1 is replaced by an arbitrary m ^ 1. 
In fact, although p m (n) ^ 0(1) as n — > co holds for every m, the function p m {ri) 
of n is almost periodic ( B ), with a Fourier series ( B ) representing fi m {ri) for 
every n, if m ^ 1. This follows by observing that, according to (146), 

22 d(n)p m ' in) /n = 22 d(p m )m/p m = (m + 1 )m 22 l/v™, 

n = 1 p p 

and so the assumption of (XVIII) is satisfied unless m = 1. 

Almost periodicity of any kind is more than sufficient for the existence of an 
asymptotic distribution function (as to the latter notion, cf. Jessen and Wintner 
[48], Hartman, van Kampen and Wintner [44], Erdos and Wintner [22]). Since 
p m (n) is capable of the values 0, 1, 2, • • • only, it follows that there exist limits 
a 0 m , cti m , * • • satisfying 

00 

(148) lim Si m (n)/n = a™ and 22 a™ = 1, if m 1, 

n— >oo 1=0 

where S™ denotes the set of those positive integers n for which /3 m (n) = l , and 
Si”\n) is defined by (136). 

According to the definition of p m (n), the function pi(n) represents the number 
of the simple prime factors of n. Hence the preceding definition shows that 

(149) Si 1 = Si for l = 0, 1, 2, ••• , 

where Si is the set defined after (136). Thus (139) and (140) represent what 
corresponds to (148) in the case m = 1. 

66. The interpretation (149) of Si leads to a general lemma, which may be 
formulated as follows (cf. Wintner [94]) : 
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Let T be any set of positive integers such that there exists a positive integer 
l = l T for which (i) no integer contained in any of the sets Si+i, S i+2 , • • • is in 
T and (ii) an integer contained in Si is or is not in T according as it is or is not 
square-free. Then, in terms of the notation (136), 



(150) 

where 



T(n) 

n 



(A(n)r 1 xoo 

(fel! 



as n 



(151) \{n) = log log n. 

In order to prove this, let L(n) be an abbreviation for the expression on the 
right of (150), where l is fixed. Then, from (139o) and (139), 

(152) S 0 (n)/n + Si(n)/n + • • • + Si-i(n)/n = o(L(n)). 

Those integers contained in T for which neither (i) nor (ii) assumes anything 
are contained in the set S 0 + Si + • • • + Si-i , and so their relative frequency 
is majorized by (152). On the other hand, the relative frequency of those 
integers contained in T which are contained in the set $z+i + Si+ 2 + • • • is 0, 
by assumption (i). Hence, in order to prove (150), it is sufficient to show that 
the relative frequency of those integers contained in T which are contained in 
Si is L(n) + o(L(n)). But since Si consists of the positive integers having 
exactly l simple prime factors, assumption (ii) can be expressed by saying that 
an integer contained in S 1 is contained in T if and only if it is square-free. Con- 
sequently, the assertion is that the relative frequency of the integers composed 
of exactly l distinct primes is L(n) + o(L(n)), where L(n) denotes the function 
on the right of (150). Since (151) shows that the expression on the right of 
(141) is identical with L(n), and since Ri(n)/n denotes precisely the relative 
frequency just mentioned, it follows from (141) that the assertion is true. This 
proves (150). 



67. Corresponding to the definition, (37), of the class of multiplicative func- 
tions, a function f(n) is called additive if 

(153) /(W 1 W 2 ) = f(ni) + f(n 2 ) whenever (fti , n 2 ) = 1. 

Since this means that 

(154) /( 1) = 0 and f(n) = f(p k ) + f(q J ) + • • • if n = p k q J • • • , 

where p, q, • • • are distinct primes, it is clear from (15) that/(n) is additive if 
and only if 

(155) f'(n) = 0 unless n = p k , (k = 1, 2, • • • ; p = 2, 3, • • • ), 

as illustrated by (121) and (146). Accordingly, an additive function is uniquely 
determined by an arbitrary assignment of a double sequence of values f(p k ). 
Since 

(156) 



/( 1) = 0, hence /'( 1) = 0 
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in virtue of (18), it is clear from (17) that such an arbitrary assignment is equiva- 
lent to an arbitrary assignment of the values 

(157) /'(p) = f(p), /'(p 2 ) = /(p 2 ) - /(p), /'(p 8 ) = /(p 3 ) - /(p 2 ), • • • 

for every p. 

For every fixed prime q and for every additive function/ = f(n), let f q = f q (n) 
denote the additive function assigned by the values 

(158) f q (p k ) = 8 pq f(p k ), (k = 1, 2, • • • ; p = 2, 3, • • • ), 

where 5 pq = 0 if p ^ q and 8 qq = 1 . Thus it is clear that, corresponding to the 
representation (43), (45) of a multiplicative function, 

(159) /(ft) = X fp( n ) for every ft 

V 

(it being understood that the infinite sum (159) has only a finite number of 
non-vanishing terms for every fixed ft). 

It is known that, if / (ft) is additive, then 

(I) /(ft) has an asymptotic distribution function if and only if both series 
2/*(p)/p, 2 | f*(p) | 2 /p are convergent, where f*(p) denotes /(p) or 1 according 
as |/(p) | < 1 or |/(p) | ^ 1 (Erdos and Wintner [22]); 

(II) /(ft) is almost periodic ( B ) if and only if all four series 

\^f(v) xy y^ 1 f(p h ) 1 y^ I f(p) 1 y^ 1 f*(v) [ 2 

p V ’ ^=2 P p k ’ |/(^l p ’ V P 

are convergent (Hartman and Wintner [42]); a criterion obtained by an adapta- 
tion of the proof of the more elegant result according to which 

(III) f(n) is almost periodic (B 2 ) if and only if both series 2 f(p)/p , 
22 | f(p k ) | 2 /p k , where k = 1, 2, • • • , are convergent (Erdos and Wintner [23]); 

(IV) f(n) is almost periodic (IF) if and only if /(ft) = 0(1) as n — * co (Hart- 
man and Wintner [43]); 

(V) /(ft) is uniformly almost periodic if and only if the series (159) is uni- 
formly convergent for n = 1, 2, • • • (this follows from the criterion of Toeplitz 
[79], quoted at the end of §51; cf. van Kampen [50]). 

These criteria, the first three of which lie quite deep (even though they do 
not involve the prime number theorem), and none of which has an analogue in 
case of an arbitrary function /(ft), will not be proved here; they are not needed 
in the sequel. 

68. For an arbitrary function /(ft) and for a given value a, let F a denote the 
set of those positive integers n = n a which is defined by 

(160) F a : f(n a ) = a . 

Thus, if F a (n) is the function (136) belonging to S = F a , then F a {n)/n is the 
relative frequency of the value a among the ft values /(l), • • • , /(ft). In par- 
ticular, the set F a is vacuous unless /(ft) attains the value a for some ft. 
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In view of (156), an additive function will be called positive if f{n) > 0 for 
every n ^ 1 . This means that F a is vacuous unless a ^ 0 and that 

(161) F 0 {n) = 1 for every n; 

or, what by (154) is the same thing, that each of the values f{p k ) is positive. 

It will now be shown that the lemma of §66 implies the following theorem 
(Wintner [90], [94]): 

(t) If fin) is a positive additive function satisfying f{p) = 1 for every prime p, 
then, no matter what the {positive) values f{p 2 ), f{p s ), • • • assigned for p = 2, 3, • • • 
may be , the asymptotic distribution of f{n) over the sets Fi , F 2 , • • • obeys Poisson’s 
statistical law of independent rare events *, that is, 



(162) 



Fi(n) 

n 



(A(n)r 1 -xin) 
(7-1)! 



as n 



{l = 1 , 2 , ••■), 



where X(n), the squared standard deviation {variance), is log log n. 

The occurrence of the function (151) in (162) is a manifestation of Mertens’ 
elementary result, used in §57. On the other hand, (162) depends on the prime 
number theorem (which it contains for l = 1). 

Since the values/^) assigned for k > 1 need not be integers, it is clear that 
what corresponds to (137i) is 



(163) 



1 + X Fi{n) + F*{n) = n 



for every n, 



where the first term represents the contribution, (161), of the set F 0 , and F* 
denotes the set of those positive integers at which the value attained by / is not 
an integer. However, it is easy to adapt the proof given in §66 so as to yield the 
estimate 



(164) 



F*(n) = o{n); hence ^ Fi{n)/n — > 1 



holds in virtue of (163). Since, from (151), 



(105, 



= E 

z=o 



(log log n) 1 1 _ e ] 



,log log n 



l\ 



log n log n 



= 1 , 



it follows that the sequence of the asymptotic laws (162) represents a case of 
complete additivity . 

It is by no means clear that this must be the case. In fact, the situation is 
quite different for either of the classifications represented by (141) and (139)- 
(140); classifications in which the sum of the elementary approximations is 1 and 
A respectively. In fact, the classification (141) refers to all square-free integers, 
and the latter are known to have the asymptotic relative frequency l/f(2), 
which is distinct from 1. On the other hand, the classification considered by 



Cf., e.g., G. Darmois, Statistique Mathematique, Paris, 1928, p. 84. 
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(139)- (140) enumerates the set of all positive integers, and the latter have the 
frequency 1, which, by (138), is distinct from A. 

Incidentally, (138)- (139) also show that the assumptions, f(p) = 1 and 
f(p k ) > 0, of (162) are so essential that (162) becomes false if only/(p) = 1 
and/(p fc ) ^ 0 are assumed. In fact, if/(n) = (3i(n), where /h (n) is the number 
of the simple prime factors of n, then the set (160) belonging to a = l is precisely 
the set (149) to which (138)-(139) refer. Nevertheless, f(n) = (3i(n ) is a non- 
negative additive function satisfying/(p) = 1 (however, /(p 2 ) =/(p 3 ) = • • • = 0). 
Needless to say, the assumption f(p) = 1 of (162) is only a normalization of the 
assumption /(p) = const. > 0. If const. = 0, it is illustrated by (148) how 
radically is the situation changed. 

69. The proof of the Poissonian theorem (f), §68 proceeds as follows: 

Let l be a fixed positive integer. Since /(p) = 1 and/(p fc ) > 0 by assumption, 
it is clear from (154) that f(n) is equal to or greater than the number of the 
simple prime factors of n, according as n is or is not square-free. Hence, if S m 
denotes the set of those positive integers having exactly m simple prime factors 
(a set containing not only square-free integers), then (i) no n contained in 
S i+ i , S 1 + 2 , • • • can satisfy /(n) = l and (ii) an n contained in Si satisfies f(n) = l 
if and only if it is square-free. Hence both assumptions, (i)-(ii), of §66 are 
fulfilled by the set T consisting of those positive integers n for which f(n) = l , 
where l is a fixed positive integer. Since T = Ft in virtue of (160), it follows 
from (150) that the proof of (162) is complete. 

70. It is worth pointing out the common formal background of (t), §68 and 
of (/)-(F), §67. 

Let gi(n), • ■ • , g m (n) be m periodic functions of n, and suppose that their 
periods, say ki , • • • , k m , are relatively prime (in the narrower sense of the term, 
that is, in the sense that ( ki , k 3 ) = 1 unless i = j). Then, since the product 
k = ki • • • k m is a common period of gi , • • • , g m , a straightforward counting 
(or, equivalently, a verification of “ergodicity” with reference to an “incompres- 
sible flow”) shows that 

k m m kj m 

(166) Yj n (s 'i(n)) Ti = ns (gi(n)) ri , where k = II , 

71=1 2=1 2=1 n = l 2=1 

is an identity in the m non-negative integers r\ , • • • , r m . If the functions are 
real-valued, then (166) means that gi(n), • • • , g m (n ) are statistically independent 
(in the sense in which this term has always been used*). The complications 
arising in case of complex-valued functions are only typographical in nature. 

Now let/(n) be an additive function and suppose, for a moment, that there 
exists for every p a sufficiently large m = m p so that f'(p m+1 ) = f'(p m+ 2 ) = • • • 0. 
Then it is clear from (15) and (158) that each of the additive functions f p (n ) 

* Cf., e.g., p. 194 of Darmois’s book, referred to in the preceding footnote, or A. Kolmo- 
gorofTs Grundbegriffe der Wahrscheinlichkeitsrechnung, Berlin, 1933, p. 50. 
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occurring in (159) is periodic; the primitive period of f p (n) being if m = m v 
has its least value. It follows that the functions gi(n) = f p (n ), <72 (w) = f q (n), 
representing the various terms of the series (159) have periods ki , k 2 , • • • 
satisfying the assumption of (166). 

If m = nip does not exist, f P (n) is not strictly periodic. However, the sta- 
tistical independence still holds in terms of asymptotic distributions (cf . Wintner 
[88]; Hartman, van Ivampen and Wintner [44]); as a matter of fact, it holds 
(cf. Hartman and Wintner [43], §3) even in the so-called unrestricted sense 
(cf. Jessen and Wintner [48], §11). 

Incidentally, all of this can be extended to the more general case indicated at 
the end of the footnote to (21). 

The Statistics of the Sum of Two Squares 

71. Let g(n) denote the function for which the function g'(n), defined by (15), 
attains the value ( — l) 5(n_1) or 0 according as n is odd or even. Then obviously 

(167) g( 2 k ) = 1, g(q k ) = k + 1, g(r k ) = \ + §(-1)* 
for k = 1,2, • • • , if q and r denote those primes for which 

(168) q = 1 (mod 4) and r = 3 (mod 4) 

respectively (it is understood that J + §( — 1)* in (167) denotes 1 or 0 according 
as k is even or odd). 

Since the function g'(n) is multiplicative, the same is true, by (39), of the 
function gin). If g'(n) is replaced by/'(n) = 4 g'(n), then the sum (15) becomes 
the classical formula for the number, r 2 (n), of the representations of n as a sum 
of two squares. In other words, g{n) denotes one quarter of the number of those 
points in a planar square lattice of unit width which have the distance ri* from 
the origin (0, 0) of the lattice. 

In accordance with the notation (160), let all positive integers n be classified 
into a sequence of mutually disjoint sets Go , Gi , • • • , by placing n into Gi if and 
only if g{n) = l. Since gin) is a multiplicative function, it is clear from (167) 
that every “urn” Gi contains an infinity of integers. 

The majority of the integers n are in Go , in the sense that, in terms of the 
notation (136), 

(169) G 0 (n)/n -a 1 as n — > <*> 

(for a sharper result, cf. (170)-(171) below). If y(x) denotes the asymptotic 
relative frequency (“probability”) of the inequality gin) < x, where x is a fixed 
real number, then (169) is equivalent to the statement that y(x) = 0 for — co < 
x < 0 and y (x) = 1 for 0 < x < . This implies that gin) cannot be almost 

periodic (£). For if it were, Mig) ought to be represented by the first moment 
of the asymptotic distribution function, 7, of g , that is, by the Stieltjes integral 
J xdyix) over the range — °o < x < co . Since this integral is 0, there results 
a contradiction with the evaluation M(g) = 7r/4, obtained in §7 as a consequence 
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of (29 2 ). [However, if a m denotes (— l) 5m_ V/rn or 0 according as m is odd or 
even, then the trigonometric series (86) represents f(n) = 4 gin) for every n 
(Ramanujan [69], p. 190; the convergence of the series is substantially equivalent 
to the prime number theorem) .] 

72 . There arises the question as to the asymptotic law (if any) of the relative 
frequencies, Gi(n)/n , determined by the distribution of all positive integers over 
the various “urns” Gi , where l = 0, 1, • • • . The answer,* which will be given 
completely, turns out to be quite involved. In view of everything that is known 
on questions of this type, it is not surprising that the problem happens to depend 
on the machinery of the prime number theorem. What makes the situation 
involved is not this methodical circumstance but rather the fact that the order 
of magnitude of the relative frequency Gi(n)/n for large n and fixed l depends on 
the arithmetical structure of l in an intricate manner, instead of being dependent, 
as in the Poissonian law (f), §68, on the mere size of 1. Correspondingly, the 
result to be obtained disposes not only of the crude mechanism considered by 
Tricomi [81] but also of the possibility of the statistical model subsequently 
suggested by Levy [61]. 

Clearly, the analogue of (163) is 

(170) G 0 (n) + G x (n ) = n, if <T(n) = E Gi(n), 

1=1 

that is, if G°° denotes the set, Gi + G 2 + • • • , of those positive integers which can 
be represented in at least one way as a sum of two squares. It is known that 
(169) can be refined to 

(171) CT(»)/» - C“(log «)"*, ( V = g ft) , 

where C°° (as every C in the sequel) denotes a positive numerical constant 
(Landau [57]; cf. also Watson [84], Wintner [93]). The question formulated 
before refers to a corresponding asymptotic relation (if any) for each of the 
terms of the infinite sum (170). 

It will be shown that 

(172) Gi(n)/n Ci(n log n )~ 2 

(which, in view of (171), implies that the first term of the infinite sum (170) is 
of a much lower order than the sum itself) ; that 

(173) G 2 (n)/n ~ C 2 (log n) _1 

* The question can be interpreted in terms of the distribution of the characteristic values 
of a vibrating quadratic membrane (cf., e.g., Riemann-Hattendorff, Partielle Differential- 
gleichungen, 2nd ed. (1876), pp. 250-258), but the answer can hardly have an acoustical 
interest. 
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(so that the second term is almost of the same order as the sum itself) ; that 

(174) Gs(n)/n ~ C 3 (n log n)~ log log n 

(so that the third term is of almost the same order as the first) ; that 

(175) <?,(»)/» ~ Ci{n log n) _i if l = p k > 3, (ft = 1, 2, • • • ), 

(so that l = 2 and l = 3 are the only prime powers for which the order of Gi(n) 
is not the same as that of Gi(n)/n); finally, that one of the three possibilities 

(176) Gi(n) ~ const. G\(n ); X = 1, 2, 3, (const. > 0) 

must take place for every positive integer l (the actual value of X in (176) is an 
arithmetical function of l). 

73. Let gi(n) denote the characteristic function of the set Gi . Then, corre- 
sponding to (136), 

(177) Gi(n) = E gi(m). 

m = 1 

The generating Dirichlet series is 

(178) <l>i(s) = E gi(n)n~‘. 

It will be necessary to consider the function (178) in its dependence on the 
arithmetical structure of l. 

In view of (170) and (171), it can be assumed that l ^ 0, that is, that n is in 
one of the sets G \ , (?2 , • • • . This disposes of all but those integers n which are 
of the form 

(179) n = 2 A nyny‘> 

where h, j, i .denote non-negative integers and q, r represent the two classes 
(mod 4) of odd primes, as defined by (168). In fact, since g(n) is a multiplica- 
tive function, it is clear from (167) that (179) is necessary and sufficient for 
gin) 0, that is, for an n not contained in Go . 

The coefficient gi(ri) of (178) is 1 or 0 according as g(n) = l does or does not 
hold for the multiplicative function g{n) defined by (167). Hence it is clear 
from (167) and (179) that (178) is reduced for l = 1 to 

(180) 4>iis) = E(2*IUT s , (? > 1). 

Next, if l has any fixed value which is a prime power, say l = p k , where k = 1, 
2, • • • , then it is clear that the multiplicative function g{n) defined by (167) 
satisfies g (n) = l only if the first product in the factorization (179) of n is reduced 
to a single factor, q 3 ; and that this necessary condition becomes sufficient if 
and only if the exponent of q 3 is restricted to have the fixed value assigned by 
j = l — 1. This means that (178) is reduced to 

4>i(s) = E (2V/ _1 n rT\ if l = V- 



64 



ERATOSTHENIAN AVERAGES 



Hence, if co(s) is an abbreviation for 

(181) «(«) = E <f\ (o- > 1), 

it is seen from (180) that 

(182) 4>i(s) = 0i(s)«([Z - l]s), if l = v- 

It is clear from this deduction that, if l is the product of two prime powers, 
say of a and of b , where (a, b) = 1, then (182) must be replaced by 

(183) * ab (s) = fc(«) E qT (a ~V- 1)s , 

where the summation ranges over all distinct pairs of distinct indices qi , g 2 . 
It is understood that a q always denotes a prime satisfying the first of the 
conditions (168). 

Finally, if l = abc • • • , where a, b, c, • • • are powers of distinct primes, then 
results if the double sum on the right of (183) is replaced by the corre- 
sponding multiple sum. 

74. Clearly, the Dirichlet series (180) has the Euler factorization 

Ms) = e 2~ hs n e r~ 2is = a - 2-r 1 n a - o -1 . 

h—0 r i=0 r 

Hence, from f (s) = II (1 — p _s ) _1 , where p = 2, q, r in the notations of (168), 

(184) <Ms) = (1 + 2 _s )f (2 s)t7(2s), 
where, as an abbreviation, 

(185) v(s) = n a - <n, (fl > 1). 

Thus the behavior of all the Dirichlet series (178) on the boundaries of their 
respective half-planes of convergence is substantially reduced to the behavior 
of f (s) and of rj(s) on the line a = 1. 

It is known from the theory of the L-series of Dirichlet that, if q runs through 
all primes satisfying q = k (mod m), where ( k , m) = 1, then the contribution of 

(185) to the behavior of l/r(s) = 11(1 — p~ s ) on the line a = 1 is proportional 
to the ratio a = 1 :0(m); in the sense that the function (s — l)~ a r}(s) is regular 
and distinct from 0 on the line a = 1 (for sharper results, cf. Watson [84]). 
Since q = 1 (mod 4) in the present case, it follows from 0(4) = 2 that 

(186) t ]($) = c 0 (s — 1 Y + p 0 (s) for cr ^ 1, 
and so, from (184), that 

(187) 0i(s) = Ci(s — + pi(s) for a ^ J, 

where, as always in the sequel, every c is a positive constant and every p(s) de- 
notes a function satisfying the following conditions: It is regular in the interior 
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of the half-plane specified and possesses a continuous boundary function. For 
instance, from (181) and (185), 

(188) co(s) = — log 7] (s) + p(s) for a ^ 1. 

75. It is known that Ikehara’s theorem can be extended as follows (cf. Wint- 
ner [93]): 

If a Dirichlet series 2/(n)n _s , where /(n) ^ 0, converges for a > 1 and is 
such that there exist two positive constants, B and 0, for which the function 
Xf(n)n~ s — B/(s — 1)^ acquires continuous boundary values on the line <7 = 1, 
then 

n 

(189) Z/(m) const. n/(log n) 1 ^ as n— * oo , where const. = 5/r(0). 

m— 1 < 

The logarithm disappears in Ikehara’s case, where 0=1. 

Since every gi(n) in (178) is either 1 or 0, it follows from (187) that the case 
0 = \ of the theorem is applicable to the Dirichlet series of the function 0i (s + J). 
In fact, the replacement of s by s — \ translates the line a = \ of (187) into the 
line cr = 1 of the theorem, and transforms the coefficient gi(n) into gi(n)n 5 , a 
coefficient which is again real and non-negative. Accordingly, 

n 

(190) X) gi(jn)m ? ~ const, (n / log n)*, (const. > 0). 

m= 1 

It is seen from (177) by partial summation that (190) completes the proof of 

(172). 

In the case l = p k of (182), it is clear from (188) that, unless l — 1 ^ 2, the 
“nearest” singularity of the second factor of (182) lies beyond the critical line, 
a = J, of the first factor, (187). Hence, (175) follows in exactly the same way 
as (172). Thus, if l = p k , only the cases l — 1 ^ 2, that is l = 2 and l = 3, 
remain to be treated. 

If l = 2, the second factor on the right of (182) is the function (188). Hence 
it is clear from (186) and (187) that (182) is now of the form 

(191) <j) 2 (s) = — c 2 log(s — 1) + p 2 (s) for <7^1. 

Consequently, (173) follows by comparing the case l = 2 of (177) and (178) with 

JZ 1 n / log n and 1 -p~ s = log f(s) + p(s), where <7^1, 

p < n p 

and observing that the analytic behavior of log f (s) on the line <7 = 1 is the same 
as that of the function (191). 

If l = 3, then (188), (187), (186) and (182) show that (191) must be replaced by 

(192) 03 (s + i) = — c 3 (s - h)~ h log(s - i) + p 3 (s) for <7 ^ j. 

It follows that 

n 

(193) ^ gz(m)m* ~ const . (n log log n)/(log n)\ 

m= 1 



(const. > 0). 
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In fact, the order assigned by (192) for the case l = 3 of (178) is an Abelian 
consequence of (193). On the other hand, the Tauberian transition from (192) 
to (193) follows from the general theorem which results by rewriting, for the 
case of the singularity of (192), the general theorem used in the Tauberian 
transition from (187) to (190). Correspondingly, the partial summation 
leading from (190) to (172) leads from (193) to (174). 

This completes the treatment of the cases l = 1 and l = p k . 

76 . Suppose now that l = ab , where a and b are of the form p k , and (a, b) = 1. 
Then <#q(s) is given by (183). But it is seen from (181) that (183) is substantially 
the same function as 

(194) <£i(s)co([a - l]s)co([6 - l]s) ; 
the necessary additive correction being just the function 

(195) E = w ( [a _|_ b _ 2 ]s), 

the effect of which can be appraised as follows: 

First, a + b ^ 2 + 3, since a and b are either distinct primes or powers of 
such primes. Hence it is clear from (181) that the additive correction, (195), 
is regular in the half-plane 3<r > 1. Since this half-plane contains not only the 
half-plane a ^ 1 but the half-plane a ^ J as well, it follows from (187) and 
(188) that the relevant “nearest singularities” of (183) can always be replaced 
by those of (194). 

As in the case, l = p k , of §75, three subcases of the present case, l = ab, will 
have to be distinguished. 

The least possible value of a is 2. Suppose first that a = 2 in l = ab. Then 
6^3. Hence it is clear from (187) and (188) that the analytic behavior of 
(194), and therefore that of (183), on the critical line is the same as that of 
(191). Consequently, (176) holds for X = 2. 

Next, suppose that a = 3 in l = ab. Thus, since the case a = 3, b = 2 is 
identical with the case a = 2, b = 3, considered before, it can be assumed that 
b = p k is at least 4. Hence, for the same reasons as in the preceding case, it 
now follows that (176) holds for X = 3. 

Finally, the same reasoning shows that (176) holds for X = 1 in the remaining 
case of the assumption l = ab, that is, in case a and b are powers of two distinct 
primes both of which exceed 3. 

Since all of this can be transcribed from l = ab to the general case l = abc • • • , 
described at the end of §73, the proof of the assertions of §72 is complete. 

The Ergodic Law of Multiplicative Sets 

77 . Let a set, S, of positive integers be called completely multiplicative if it 
not only is multiplicative in the sense of §52 but is such as to contain the product 
of any two of its elements, even if the latter are not relatively prime (or distinct) . 
Thus S is uniquely determined by its multiplicative basis, that is, by the se- 
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quence, say S f , of the primes contained in S. Conversely, any given sequence 
of primes generates a completely multiplicative set. 

The prime number theorem and the results connected with it succeed, to 
some extent, in analyzing the unknown set S ^ in terms of the given set S when 
the latter is the sequence of all positive integers. The success of this analysis 
depends on that esoteric state of saturation which compels the set S to be equi- 
distant when is given as the sequence of “all” primes. In fact, the elementary 
approximations of Chebyshev and Mertens to the rough order of magnitude of 
the m-th prime involve, via Stirling’s theorem, the explicit assumption that S 
be equidistant, and supply explicit statements as to the asymptotic distribu- 
tion of S. 

However, there exist less explicit forms of the prime number theorem; formu- 
lations which do not refer at all to the order of magnitude of the m-th prime. 
For instance, it is known that the prime number theorem is equivalent to the 
assertion that the mean, M(p), of the Mobius function, n(n), exists; (Landau 
[58]; cf. Hardy and Littlewood [37]). Since the value of M(p) is 0, and since 
n(n) is capable only of the values 0 and ±1, the prime number theorem thus 
appears in an “ergodic” form, stating that there exists an asymptotic probability, 
i, that n(n) be positive, and therefore the same probability that pin) be nega- 
tive, if the trivial frequency of the n-values for which pin) vanishes is subtracted. 

The aim of this chapter is to prove that a corresponding formulation of the 
prime number theorem holds in case of any sequence of primes. In view of the 
exceptional set of measure 0, excluded by the law of large numbers or, more 
generally, by Birkhoff’s ergodic theorem (an exceptional set which is of the 
second category in the usual realization of the underlying product measure), it 
is of course a purely arithmetical fact that the proviso of the “almost all” can 
be replaced by the assertion that what corresponds to the exceptional set is 
now vacuous. 

The results will be such as to hold for arbitrary multiplicative sets, and not 
only for the completely multiplicative sets defined above. 

78 . With reference to a given multiplicative set, S , let the sequence, P, of all 
primes be subdivided into two subsequences, say Q and P, by placing a prime 
into Q or into R according as it is or is not contained in S. In order words, if 
/ ( n ) denotes the multiplicative function representing the characteristic function 
of S , let 

(196) p = q when f(p) = 1 and p = r when f(p) = 0 , 

where q, r and p denote arbitrary elements of Q, R and P respectively. If 
k > 1, then both f(q k ) and/(r fc ) can be either 1 or 0 (depending on k and p). 
Since /(l) = 1, it follows from (17) that 

(197) /V) = -1,0, 1, 
holds in both cases, p = q and p = r, and that, if A; = 1, 

(198) f'(q) = 0, f'(r) = — 1. 



(fc = 1,2,..-), 
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Since /(n) is a multiplicative function, (39) and (197) imply that 



(199) 

It also follows that 



fin) = -1,0, 1, 



(n = 1, 2, • • • )• 



(200) £ X) \f(V k ) \/v k < 00 , where f(p k ) = f(p k ) -/(p* *). 



oo 



A;=2 p 



79. Correspondingly, the following elementary alternative is a mere restate- 
ment of the standard limiting form of the sieve of Eratosthenes: 

(i) If fin) is the characteristic function of a multiplicative set , S , then M(J ) ex- 
f$£s. Furthermore , df(/) = 0 or 0 < ilf (/) ^ 1 according as the sum of the reci- 
procal values of all primes not contained in S is divergent or convergent ; in fact , 
(42) ZioZds in both cases. 

Suppose first that the sum mentioned in (i) is convergent. This means, by 
(196), that 2 r -1 < oo. It follows therefore from (198) that (200) remains 
valid if the lower limit, k = 2, of the exterior summation is replaced by A; = 1 . 
Hence the case g(n) = \f'(n)/n \ of (51) shows that 2 | f'(n) \/n < oo. This 
implies that (50) is true for g(n) = f'(n)/n , and that M(f) exists in virtue of the 
remark following (VIII), §20. Thus (42) is true. Finally, (42) and (196), 
where 2 r -1 < oo by assumption, imply that M(f) > 0, since every f(p k ) is 
either 0 or 1 . 

It is also seen that, if 2 r~ l = oo , the product on the right of (42) is 0. Hence, 
in order to complete the proof of (i), it is sufficient to show that M(f) exists and 
vanishes when 2 r~ l = oo . 

To this end, let S m denote, for every positive integer m , the multiplicative set 
which, in terms of its characteristic function, f m (n), is defined as follows: If 
p is a q, then f m (p k ) = f(p k ), and if p is an r, then f m (p k ) = f(p k ) or f m (p k ) = 1 
according as r < m or r ^ m, where it is understood that k = 1, 2, • • • in all 
three cases, and that q and r refer to the disjunction (196). It is clear from this 
definition that the relations 



hold for every n = p k , and therefore for every positive integer n. It is also 
clear that f m (q) = 1 for every q and that f m (r) = 0 or /m(r) = 1 according as 
r < m or r ^ m. 

In particular, S m contains all but a finite number of primes. Since the recip- 
rocal values of the excluded primes form a finite sum, it follows from what has 
already been proved that M(f m ) exists and is represented by the product (42) 
belonging to f m {n). Hence it is seen from the values f m (p k ) defining the function 



f m (and, in particular, from the values of f m (q) and f m (r) pointed out before), 
that, since f{p k ) = f(p k ) - f{p k ~ l ), 



( 201 ) 



fm(n) ^ and lim f m (n ) = f(n ) 
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where C > 0 denotes the value of the product which results if p is replaced by 
q on the right of (42). Since C is independent of m, it follows from (199) that 

M(f m ) = C II (l - - + O E -A as m -> oo 

r <m ( T k=2 T J 

where the 0 refers to r — > oo . Hence the assumption 2r -1 = °o implies that 
M ( f m ) — » 0 as m — > oo . 

Finally, the limit relation (201) can be written in the form / = fi + (fz — fi) 
+ • • • which, by (2), implies that N(f) ^ A(/i) + N(fz — fi) + ■ • • . But the 
terms of the latter series are non-negative, and its ra-th partial sum is 
since | f m — f m - i 1 = fm- i — fm, by the inequality (201); cf. (2) and (3). Conse- 
quently, N{f) ^ M(fm) for every m. It follows therefore from M(f m ) — > 0 
that N(f) = 0. Hence it is seen from (2) and (3) that M(f) exists and vanishes 

80 . A curious corollary, which is still elementary, may be formulated as 
follows: 

(ii) If ds(n) denotes the number of those divisors of n which are contained in a 
multiplicative set , S, then 

(202) - X d s (m) — S ' — — * — r'(2)| S | as n — > oo , 

n m= i m = i m 

where the accent of the second summation refers to the omission of those summation 
indices not contained in S, and | S | is the asymptotic relative frequency (“density”) 
of S. 

In other words, | S | = M(g), if g(n) denotes the characteristic function of S. 
Thus the norm (2) is finite, since g(n ) = 0(1). Furthermore, M(g) exists, by 
(i), §79. Hence the assumptions of (VI), §20 are satisfied by the function 
f(n) for which /'(n) = g(n). Since (15) shows that this/(n) is precisely d s (n), 
the proof of (ii) is complete. 

It is seen from the formula following (29i) that the determination of the order 
of the remainder term of the limit relation (202) is Dirichlet’s divisor problem, 
if S is the set of all positive integers. 

81 . In his theory of closed cycles (that is, of multiplicative sets of positive 
integers), Cesaro ([10], pp. 146-147) has asserted it to be evident that the series 
2 f'(n)/n is convergent whenever /(n) is the characteristic function of a multi- 
plicative set (it is clear from the context that Cesaro’s argument is based on 
(30)-(31), §8). However, if this were really evident, the prime number theorem 
would be trivial indeed, since it would follow from the definition (22), where 
f(n) is the characteristic function of the multiplicative set consisting of a single 
integer, 1. In view of (III) and (IV), it is somewhat surprising that Cesaro’s 
final result turns out to be correct after all: 

(iii) If f(n) is the characteristic function of a multiplicative set , S, then the series 
2f'(n)/n is convergent. Furthermore , 2 f'(n)/n = 0 or 0 < 2f'(n)/n ^ 1 accord - 
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ing as the sum of the reciprocal values of all primes not contained in S is divergent 
or convergent ; in fact, (41) holds in both cases. 

Corollary. The mean M(f') exists and is 0 in both cases. 

In fact, /'(n) = 0(1), by (199). Hence the first part of (iii) follows from the 
first part of (i) in virtue of (X). Correspondingly, the second part of (iii) is 
implied by (I) and by the second part of (i). 

Finally, the assertion of the Corollary follows from (4) and from the first part 
of (iii). Incidentally, (VI) implies the validity of the converse inference. 

82. The preceding result contains the answer to a question which, according 
to oral communications, has repeatedly been raised and often considered (al- 
though it does not seem to have been formulated in the literature). The diffi- 
culty in answering the question is that, as will be seen in §83, the classical route 
of the analytic theory of primes, followed after Riemann by Hadamard-de la 
Vall6e-Poussin and Landau-Ikehara, is now blocked by function- theoretical 
obstacles. 

Let P = Q + R be a disjunction of the sequence, P = {p}, of all primes into 
two complementary subsequences, Q = {g} and R = {r}. Let p R {n) denote the 
Mobius function belonging to R. By this is meant that p R (n) denotes the multi- 
plicative function for which p R (p k ) is —1 or 0 according as neither or either of 
the conditions p = r, k = 1 is violated. Accordingly, the generating Dirichlet 
series is 

(203) l/f*(«) = E if f *(a) = n (1 - 0 " 1 « E O'*)" 8 

(cr > 1), where the indices n, r and r* respectively run through all positive in- 
tegers, through the given prime sequence R = {r} and through the sequence, 
P* = {r* } , of those positive integers which are not divisible by any prime, q , 
contained in Q = P — R. 

Correspondingly, Q* = {g*} will denote the sequence of the positive integers 
generated by the prime sequence Q = {#} . Thus 1 is the only integer common 
to P* and Q *. Needless to say, Q* + R* is not the set, P* = (Q + P)*, of all 
positive integers (unless either Q = P or R = P, which is the case of the classical 
functions p, f). It is also clear that, in the notations of §77, the set S = Q* is 
the completely multiplicative set generated by the basis = Q. In other 
words, if f(n) denotes the characteristic function of S = Q*, then 

(196 bis) f(p k ) = 1 if p = q and f(p k ) = 0 if p = r, 

where k = 1, 2, • • • in both cases. Consequently, not only (198) but also 

(198 bis) f'ict) = 0, /'(r*) = 0, where k > 1, 

follows from (17). This, when compared with the values p R (p k ) assigned before 
(203), shows that f'(p k ) = p R (p k ) holds for k = 1, 2, • • • in both cases, p = q 
and p = r. Since /(n) and p R (n) are multiplicative functions, it follows from 
(39) that/'(n) = p R (n) holds for every positive integer n. 
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Consequently, that particular case of (iii) in which S is completely multiplica- 
tive, can be formulated as follows: 

(iv) If p R (n) denotes the Mobius function belonging to an arbitrary subsequence , 
R = { r } , of the sequence of all primes , then the series 2 p R (n)/n is convergent . 
Furthermore , 2 p R (n)/n = 0 or 0 < 2 p R (n)/n ^ 1 according as 2 r -1 = oo or 
2 r 1 < oo ; in fact , the reciprocal value of the product (203) at s = 1 represents the 
value of l/f fl (l) = 2 p R (n)/n in both cases. 

Corollary. The mean M(p R ) exists and is 0 in both cases. 

Incidentally, the more general theorem, (iii), is readily seen to be implied 
by (iv). 

83. The function-theoretical situation referred to at the beginning of §82 
can conveniently be described in terms of Ikehara’s theorem. This theorem 
(Wiener [85]; cf. also Karamata [54], Ingham [46], Beurling [4]), when formulated 
for ordinary Dirichlet series 2 f'(n)n~ s which converge in the half-plane a > 1, 
asserts that, if there exists a constant, c, for which the function 

(204) 4>(s) = Z/'(n)n- 8 - c/{s - 1), (a > 1) 

* 

becomes uniformly continuous in a rectangle —T<t< T, 1 < a < 2, where 
s = < 7 + it, then M(f') exists, and has the value c, whenever the following pair 
of Tauberian conditions is satisfied: (i) T can be chosen arbitrarily large and 
(ii) f'(n) = 0 L ( 1) as n — » °o. 

If f'(n) is given either by (26) or by (22), then (ii) is satisfied, 2 f'(n)n~ s is 
either — d log £(s)/ds or l/f(s), and so, if c = 1 and c = 0 respectively, (i) is 
reduced to f (1 + it) ^ 0, — oo < t < °c , in both cases, where f (s) is Riemann’s 
zeta-function. Hence, Ikehara’s theorem supplies the prime number theorem 
not only in the form M (A) = 1 but also in the form M(p) = 0 (usually derived 
not directly but as a consequence of ilf(A) = 1). However, it is easy to see 
that the method fails in the case of §82. 

In fact, let f'(n) = p R (n). Then, since M(p R ) should vanish, c is 0, and so 
(204) and (203) imply that </>(s) = l/f fl (s). Furthermore, (ii) is satisfied, since 
| p R (n) | ^ 1. Finally, (i) requires that 1/f R (s) should go over into a continuous 
boundary function on the line a = 1 . Since the latter requirement is satisfied 
when either R = P or Q = P, it is clearly satisfied also in the contiguous cases, 
represented by prime sequences {r} = R = P — Q which are either so “dense” 
or so “thin” on P that, for instance, 2 g" 1 < co or 2 r -1 < °o respectively (in the 
precise sense of the theory of product measures, the probability of either of 
such choices of R is zero, since Q + R = P and 2 p~ l = oo ). 

However, in case of an arbitrary R, any continuity in the boundary behavior 
of l/f*;(s) (or, for that matter, of £ R (s) itself) is doubtful, since the product 
(203) might acquire a highly pathological natural boundary on the line a = 1. 
It is true that, by the proof of Ikehara’s theorem, the assumption of a continuous 
boundary function can readily be replaced by the assumption that </>(s) has a. 
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majorized L-integrable boundary behavior; that is, by the assumption that 
there exists for every T > 0 an L-integrable function \p(t), —T<t< T, satis- 
fying | <f>(s) | < \l/(t) for — T < t < T, 1 < a < 2. But if R is unspecified, the 
existence of such majorants is about as problematic as that of a continuous 
boundary function. In fact, very serious difficulties seem to arise if an existence 
proof for something like an L-integrable majorant is attempted not even for 
every T > 0 but merely for some T = e (or, for that matter, for to — e < t < 
to + e, where to need not be 0; even if (iv), §82 is granted, all that follows from 
general theorems is that, for every fixed real to , 

(205) l/f«(s) = o(\ s — So | -1 ) as s — > so 

holds when s tends to So = 1 + ito within a Stolz wedge) . 

84 . Accordingly, it was essential to approach (iv), §82 via (X) or (IXi). 
Naturally, the proof of the implication, (L) — » ( A ), of Hardy and Littlewood, 
an implication used in the proof of (IXi), depends on the prime number theorem 
itself (that is, on the convergence of 2 y{n)/n in the classical case, R = P), and 
even on somewhat more. In fact, it depends on the convergence of the Dirichlet 
series of the Z-th derivative of Riemann’s l/f(s) at the point s = 1 if l = 2, or 
at any rate if l = 1 + e (the convergence of this series is known for every l , and 
is needed for l = 1 + e in order to ascertain the satisfaction of what corresponds 
to (iii), §10 in case of the linear transformation which sends the Lambertian 
average into the Abelian one; cf. [37]). However, (iv), §82 appears to be the 
first instance of a theorem showing that the real Lambertian generator (59) is 
actually superior to the complex Dirichletian generator (25). Thus Wiener’s 
direct proof [85] of the particular (Tauberian) case/'(n) = 0 L ( 1) of the general 
(Abelian) implication (L) — ► ( A ) of Hardy and Littlewood, a particular case 
which would have sufficed in the proof of (IXi), acquires a methodical interest. 

Needless to say, the difficulties mentioned in §83 go deeper than what corre- 
sponds to the non-vanishing of Riemann’s f (s) on the line a = 1. In fact, the 
problem involves not only the boundary behavior of 1/f «(s) but that of f R (s) as 
well. This is seen by observing that, in a certain respect, f R (s) and 1/f R (s) are 
equivalent functions, since R can be replaced by Q = P — R in (203). Actually, 
the appearance of must then introduce some further complications. 

85 . A situation which can more easily be kept under “explicit” control but is 

delicate enough to illustrate the difficulties involved, results if the class of 
generalizations, of the Mobius function, a class defined before (203), is 

modified as follows: 

For a fixed angle 6 , let ve(ri) denote the multiplicative function attaining for 
n = p k the values 

Mu(p) = e 6 , 



(206) 



m«(p 2 ) = Mp 3 ) = • • • 0, 
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where p is any prime. For instance, the real-valued functions ne(ri) are p v (n) = 
n(n) and vo(n) = | n(ri) |. The generating Dirichlet series is 

(207) E ne(n)n~ s = l/f*(s), 
if fe(s) is defined by 

(208) 1/M«) = II (1 + e*p-), 

V 

where a > 1. It is also seen that, if co(n) denotes the number of all (or, for that 
matter, of all distinct) prime divisors of n, then 

(209) w(») = | **(») |e iMn \ 

| n(n) | being 1 or 0 according as n is or is not square-free. 

Let a = 1, that is, s = 1 + it. Then, since 2 | e t6 /p s | 2 = 2 p ~ 2 is a convergent 
series, (206) shows that (V), §17 is applicable to g(n) = ii Q {n)n~ s for every fixed 
t. Accordingly, the series (207) and the product (208) either both converge or 
both diverge for a fixed t, and they represent the same value in case of con- 
vergence. Furthermore, the product converges at s = 1 + it if and only if 
2 e d p~\ or simply 2 p~ l ~ l \ is convergent. On the other hand, Mertens has 
shown in 1887 (<1896) that 2 p~ x ~ lt is convergent for every ^ 0 at which 
Riemann's f (1 + it) does not vanish; in fact, the convergence of 2 p~ l ~ lt is uni- 
form on any closed bounded interval consisting of such values t ^ 0. Since 
f (1 + it) 5 * 0 for every i , it follows that the series (207) converges to a continuous 
function of the real variable t on both open half-lines 0< | £ | < <*> , a = 1. 
Hence, by a standard extension of Abel's continuity theorem for Dirichlet series, 
the function l/f 0 (s), defined by (207) for a > 1, is uniformly continuous on 
both open rectangles e < \ t \ < T, 1 < cr < 2, where e > 0 and T > e are arbi- 
trary. 

In view of the first item of (IV), §15, the last conclusion could not have been 
based on (208). On the other hand, the proof had to start with (208), since it 
was only this start that made possible the reduction of the case of an arbitrary 6 
to the classical case, 6 = w. This warrants the somewhat circuitous nature of 
the above arrangement of the proof for the existence of a continuous boundary 
function along both half-lines 0 < | t | < °o, a = 1. 

Now let (207) be identified, for a fixed 6 , with the series 2 f'(n)n~ s occur- 
ring in (204). Then the assumption (ii ) , made after (204), is satisfied, since 
| ixe(ri) | ^ 1, by (209). Furthermore, if c in (204) is chosen to be 0, the assump- 
tion (i), made after (204), is satisfied if and only if the value of 6 is such as to 
make the function l/fe(s) uniformly continuous on that portion of an immediate 
vicinity of the point s = 1 that is contained in the half-plane a > 1 . Actually, 
this requirement of continuity can be replaced by the assumption of an L-integ- 
rable majorant (in the sense of §83) near t = 0. The latter assumption is cer- 
tainly satisfied if, instead of the case s 0 = 1 of (205), an estimate 

(210) 1 /Us) =o(\s - 1 r 1 ); s -> 1, o- > 1, (6 = e 0 > 0} 

holds uniformly for all directions satisfying | arg z | < ^7r, where z — s — 1, 
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Accordingly, if a 6 is such that (210) holds (uniformly in arg z) for a suffi- 
ciently small positive e = ee (or even only such that a corresponding f logarith- 
mic 7 7 condition is satisfied when ee = 0), then the mean M(n e ) of the function 
Vein) exists and has the assumed value, c = 0. 

If 0/tt is rational, the value of c in (204) in the cas ef'(n) = i ue(n) can be calcu- 
lated from the theory of L-series (cf. §56— §57) . For instance, c = M(ne) is 
0 if d = 7r but it is 6/7 r 2 if 6 = 0. 



APPENDIX 



For the reader’s convenience, it seems to be appropriate to append here the 
proof of the fundamental implication, (L) — * (A), of Hardy and Littlewood [37]. 
The notation becomes simpler if their proof is rewritten for the case of Stieltjes 
integrals. The theorem then states that, if a(x) is a function for which the 
integrals 



L(s) = jf rzr-s* da (x), A (s) = jf e' 8 * da(x) 

exist absolutely* whenever s > 0, and if sL(s) tends to a limit, say C, as s — » 0, 
then sA(s) C (cf. §19, where r = e~ s in the present notation; the Lambert se- 
ries (59) and the corresponding power series 2 f'(n)r n result by choosing 
a(x) in L(s) and A(s) to be the step function consisting of the jumps a(n + 0) — 
a(n — 0) = f'(n), where n = 1, 2, • • • ). 

It can be assumed that a(x) is constant near x = 0. It is then obvious that 
the functions L(s), A(s) and their derivatives tend exponentially to 0, as s — > oc . 
Furthermore, it can be assumed without loss of generality that (7 = 0. The 
assertion then is that A (s) = o(l/s) is implied by L(s) = o(l/s), as s —> 0. 

Even if L(s) = o(l/s) is not assumed, 

nCO 0O 00 />0O 

L(s) = xe~ sx E e~ nsx da(x) = £ / xe~ snx da(x) 

JO n = 0 n = 1 Jo 

for every s > 0 (the term-by-term integration is justified by the exponential 
estimate, mentioned before). Since the last integral is identical with — A' (ns), 
where A '(s) = dA(s)/ds, 

OO 00 

L(s) = — z A'(ns); whence A'(s) = — ^ ix(n)L(ns) 

n = 1 n=l 

follows by Mobius inversion (cf. §25), the legitimacy of this inversion being 
trivial from the exponential estimates. Accordingly, since A(o o ) = 0, 

/ OO 00 00 /»00 

ii(n)L(nt) dt = g(n) / L(nt) dt. 

n = 1 7i=l J s 

This can be written in the form 

A(s) = [ L{t) dt = ["If L{t) dt) d\ix), 

n = 1 n J ns Jo (Isa: J 

* The full force of this assumption will not be needed. 
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if X(x) denotes the [x]-th partial sum of the series 2 n (n)/n. Since X(x) remains 
bounded as x — > oo and vanishes near x = 0, partial integration gives 



AW = f 

Jo 



X(x/s)L(x) dx , 



if x is replaced by x/s. 

The prime number theorem asserts that the series 2 n (n) /n converges to 0, 
which means that X(x) = o(l) as x — > oo. Actually, it is known that X(x) = 
o(l/log a x) holds not only for a = 0 but for every a , and therefore for some 
a > 1 (cf. §84). Since L(x) tends exponentially to 0 as x — > °o , this estimate of 
\{x) obviously implies that the contribution of the range 1 < x < °o to the last 
integral is o(l/s) as s —> 0. 

Hence, in order to prove that A (s) = o(l/s), it is sufficient to show that 

f X(x/s)L(x) dx = [ + [ = o(l/s) as s— >0. 

Jo Jo Je 

But the truth of this estimate is readily inferred from X(x) = o(l/log a x), x — » co f 
where a > 1, and from the assumption L(x ) = o( 1/x), x — > 0, which was not 
used thus far. 
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